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EDITORIAL 


_ THESIS ABSTRACTS 


Since 1938, this Journal has regularly published abstracts of appropriate 
theses which have been accepted for higher qualifications in psychology and 
education. In recent years, however, in spite of the increased output of theses, 
these have come mainly from a very small number of universities. This note is 
included to remind Heads of Departments of Psychology and of Institutes and 
Departments of Education of the service which is available. 


Thesis abstracts for consideration should be forwarded to the Editor by 
the Head of the Department concerned and may be intended ejther as the sole 
published account of the thesis or as a preliminary report of a research which is 
intended to be published at length at some future date. Itis éssential that they 
should be abstracts, not short articles, and Heads of Departments are specially 
asked to sub-edit them to make them as clear as possible consistent with 
brevity. Normally, abstracts of more than 1,500 words will not be considered 
and only exceptionally will tables be allowed. > ` 

Proofs will normally be returned to the Department of origin for correction. 


+ 


CORRECTION e 


The heading of the three papers on researches using the Maudsley Person- 
ality Inventories in the last issue of the Journal (p. 241) should have been 
“ Three Experiments using Maudsley Personality Inventories ” not “ Children’s 
Personality Inventories.” Savage used the M.P.I. in his research. 


Dr. Warburton has also pointed out that the statement by Callard and 
Goodfellow that the J.M.P.I. is the only test of this kind available for English 
children, is not strictly true. Cattell’s High School Personality Questionnaire 
(H.S.P.Q.) for ages 12-16 and his Children’s Personality Questionnaire (C.P.Q.) 
for ages 8-12, are available in this country. Although they were standardised in 
the United States they have been used in Manchester and have been found 
suitable for use with English children. 
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A FURTHER STUDY OF THE EDUCATIONAL PROGRESS 
OF CHILDREN WHO HAD RECEIVED REMEDIAL 
EDUCATION 


By K. LOVELL, C. BYRNE anp B. RICHARDSON 
(University of Leeds) 


Summary. Children who had received full-time teaching in Remedial Centres 
of an English education authority were traced, and their reading ages assessed 
along with their understanding of certain arithmetic concepts. A Summary 
of the data obtained is given together with the inter-relationships found between 
a number of the relevant variables. 


I.—INTRODUCTION. 


In an earlier paper (Lovell, ef al. 1962) some details were given relating to the 
reading ages of children who had received remedial teaching and were subse- 
quently followed-up. The pupils concerned in that study had received one 
period of remedial teaching per week, each period lasting 45 minutes. In the 
present study junior school children in a different education authority, who had 
been sent to full-time remedial centres for a period, were followed-up and their 
reading ages assessed. Furthermore, these children also undertook a test of 
arithmetic concepts at follow-up. 


It must be clearly stated at the outset that this investigation throws no 
light on whether the children who attended these centres became better adjusted 
to school work and life, or whether their general motivation and morale 
improved under full-time remedial teaching. But the study does throw light 
on the likely educational progress and eventual performance, during school 
years, of children considered to be in need of remedial teaching. Furthermore, 
it is now possible to compare the progress of these children with those in the 
earlier study who had remedial teaching for only one period a week, and it has 
been possible to throw some further light on the question as to whether, in the 
long run, children who receive remedial teaching in special centres make more 
progress than those who stay in their own schools. 


II.—SAMPLE STUDIED AND PROCEDURE. 


The local education authority had five full-time remedial centres and the 
teachers-in-charge visited the local junior schools and picked out children, 
using intelligence and reading tests, who were considered to be in need of 
remedial teaching. Not all children so picked out attended the centres. Some- 
times, the head of a junior school did not regard a particular child as suitable, 
and some parents refused permission for their children to attend the centres in 
lieu of school. In general, those who attended the centres were deemed to be 
the most likely to make most improvement under special teaching. 

An examination of the records at the centres yielded complete data on 261 
children who had attended full-time, and who had returned to ordinary schools 
not later than July, 1960. The follow-up study was done during the period 
November, 1961—March, 1962, so that no child was included in the survey 
unless he had been back at ordinary school for at least sixteen months. 
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The subjects were then divided up into those who were backward and those 
who were not, the former group being defined as those pupils whose reading ages 
(on whatever test was used at the time of referral) were less than 85 per cent. of 
their chronological ages. There was thus found to be 184 children classified as 
backward, and 77 as not backward. It will be quite obvious at this point that 
some children had been selected for attendance at these centres on the grounds 
of a discrepancy between reading age and mental age. Of the backward children, 
nine had moved to other parts of the country, one was in hospital, four were 
absent from school dwing the follow-up period, and six could not be traced by 
the authority, giving 164 backward children available for study. Of the non- 
backward group only one child was not found (she had moved to a private 
school outside the authority’s area) giving 76 pupils in this category available 
for investigation. 

The teachers-in-charge of the centres were free to use intelligence or reading 
tests (often the N.F.E.R. Sentence Reading Test) of thei own choice in selecting 
children for the centres. Of the backward children fifty-nine had taken the 
Moray House Picture Test 1, fifty-four the Intelligence Tests for young children 
(Valentine), twenty-five the Sleight Non-Verbal Intelligence Test, fourteen the 

JF.E.R. Non-Verbal Test 5, and twelve the Terman-Merrill ievision of the 
Binet test. Again, either just before or immediately after admission, 142 subjects 
had taken the Schonell Graded Vocabulary Test R1 and twenty-two the Holborn 
Reading Scale for Infants and Juniors ; while at the time of return to school, 
102 had been given the Schonell Graded Vocabulary Test R1, and sixty-two the 
Holborn Reading Scale. 

In the non-backward group forty-eight had taken the Moray House Picture 
Test 1, eleven the Intelligence Tests for Young Children (Valentine), three the 
Sleight Non-Verbal test, thirteen the N.F.E.R. Non-Verbal Test 5, and one the 
Terman-Merrill revision of the Binet test. Just before or immediately after 
admission to the centres, sixty-two children had their reading ages assessed 
using the Schonell Graded Vocabulary Test RI and fourteen the Holborn 
Reading Scale, while on return to school the numbers for these two tests were 
respectively fifty-seven and nineteen. 


On follow-up, the children were found in over fifty schools. Both the 
Schonell Graded Vocabulary Test R1, and the Vernon Graded Word Reading 
Test were given to assess the reading ages. Moreover, each pupil undertook, 
individually, a test which involved the understanding of certain arithmetic 
concepts devised by one of the authors (K.L.). The questions, which were 
duplicated were presented to the subjects. The tester read aloud a question, 
and the child followed on the paper. The latter could either answer orally or in 
writing. No time limit was imposed. When the tester was satisfied that the 
child had answered the question to the best of his ability the experimenter 
passed on to the next question. The percentage numbers of children respond- 
ing correctly to some of the least well answered questions are shown in Table 4. 


In order to have a further check on educational progress, figures will be 
presented showing the numbers of children in the bottom stream, next from 
bottom stream, etc., of multi-stream schools, on follow-up (Table 5). 


At one centre initial data were obtained on fifty children who, on the basis 
of a discrepancy between R.A. and M.A., had been regarded as being suitable for 
attendance at the centre, but who, for the kind of reasons already explained, did 
not, in fact, attend. Such children were left in ordinary schools and will be 
referred to as the control group. All children who were picked as likely to be 
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suitable for this particular centre had taken the N.F.E.R. Sentence Reading 
Test ; in addition, other reading tests were given to children when they actually 
joined the centre. The control group was matched with pupils who went to the 
centre for chronological age, reading quotient (N.F.E.R. Sentence Reading 
Test), sex, and, as far as possible, intelligence test score and school ; and this 
matched group which attended the centre will be referred to as the experimental 
group. While the matching procedure was not perfect, it can be said with 
certainty that the teachers of the control group had no idea whatever, that 
some years later these children would be followed up and retested. Moreover, 
these fifty children in the experimental group were drawn from fifteen schools 
so that the classes out of which they were taken were but little reduced in 


size. At follow-up, then, the control group was also traced and the two reading 
tests given. See Table 6. 


II.—RESsULTS. 


The data obtained have been given in tables for the sake of clarity. 


TABLE 1 


Dara RELATING TO THE PERIOD OF REMEDIAL TEACHING. 


Variable | Backward Group Non-Backward Group 
N=76 


LQ. M-=98-8, o —12- M=111-3,0 
R.A, (in years) at time of referral . | M= 8-2,0 
C.A. (in years) at time of referral.. | 1 9-2, M= 8-7,0 
j i RA | 3 
fe 7 5 pase ) 
Mean Reading Quotient ex ) 94 
R.A, (in years) at end of remedial 


teaching M= 8-4, 0 =1-4 M=10-0, 0o =1-3 
C.A. (in years) at end of remedial 


M=10-4, 0 =1-0 M= 96,0= ‘8 
8 104 


Mean improvement in R.A. during 
remedial teaching (in years)... . 

Extremes (in years) of improve- 
ment during remedial teaching 


Educational Progress after Remedial Education 


TABLE 2 


Data RELATING TO THE PERIOD SINCE REMEDIAL TEACHING CEASED. 


Variable 


Backward 
N=164 


Non-Backward 


R.A. (in years) at follow-up 
(Schonell R1) 

R.A, (in years) at follow-up 
(Vernon Graded Word) 

C.A. (in years) at follow-up 

Reading Quotient (R.A. from 
Schonell R1) 


Mean progress (in years) since 
remedial teaching ceased 

Mean length of time in years 
between end of remedial teach- 
ing and follow-up 


Extremes of Progress made (in 


years) since remedial teaching 
ended : 


12-23 months previously .... 


24-35 months Previously .... 


36-47 months previously .... 


48+ months previously .. 


4 


M=10-0, c =1-8 


M= 9-8, o =2-0 
M=13-1,¢ =1-6 


76 
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DISTRIBUTION OF READING AGES (SCHONELL R.1) at ForLow-upr. 


Reading Agein Years 


55 65 75 85 9.5 10-5 11-5 12-5 13-5 


G4 7-4 g4 9-4 10-4 11-4 


Non-Backward | ry 0 y 


Backward 


12-4 13-4 14-4 


24 28 26 
1 4 6 
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TABLE 4 


PERCENTAGE NUMBERS OF CHILDREN GIVING CORRECT REPLIES TO SELECTED QUESTIONS 
IN THE TEST OF ARITHMETIC CONCEPTS. 


Question Backward Non-Backward 


1.—In the number 26 what does the figure 6 stand 
for (worth) ? Or, What does it tell us ? 37 
2.—In the number 26 what does the figure 2 stand 


for ? Or, What does it tell us ? 


3.—In the number 404, what does the middle figure 
0 tell you ? 


actually multiplying the numbers : 


3 9x57 divided by 9 ......... 0.00... cece 13 22 
6a.—Explain how you obtained the answer you have 
just given (in 6). Or, How did you get that? .. 12 18 
a ee 


TABLE 5 


PERCENTAGE NUMBERS OF CHILDREN IN GIVEN STREAMS OF Mucti-STREAM PRIMARY, 
SECONDARY MODERN oR ALL AGE SCHOOL aT FOLLOW-UP. 


"Multi-Stream School 


Next to 
Single Bottom Bottom Top Inter- 
Stream Stream Stream in Stream mediate 
Scheol 3 or more Stream 

Stream 

School 


Backward .. 18 
Non-Backward 13 


TABLE 6 
Data RELATING TO THE EXPERIMENTAL AND CONTROL Groups—50 IN EACH GROUP. 


Experimental Group Control Group 
M=116, o =10-0 M=112, g =10-6 


M=7°-9 M=7°9 


M=85 M=85 
M=0-9 


R.A. (in years) at follow-u 
(Schonell R1) . M=10-7, o =1°2 M=10°8, o =1:3 

R.A. (in years) 
(Vernon) ++ | M=10-5, o =1-5 M=10:7, o =1-5 

C.A. (in years) at follow-up. .....| M=11:3 M=11-3 

Reading Quotient (Schonell R1). . 94 96 

Mean length of period (in years) 
since the experimental group 
first joined the Centre M=3-4 M=3-4 
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TABLE 7 


INTER-RELATIONSHIPS BETWEEN RELEVANT VARIABLES USING KENDALL’s Tau Co- 
EFFICIENT. 


Relationship Bacleward Non vana 
roup roup 


-22*% -15* 
-27** 10 


-Q)e* 


eaching holding 
aching 


*—Indicates significance at 5 per cent. level. 
**— Indicates significance at 1 percent. level. 
t—It is not possible to test the significance of Txy.z 


IV.—Discussion. 


The results may be conveniently discussed under a number of 


(1) The data relating to the backward group (which has a mean 1.0. close 
to 100) during the period of remedial teaching shows much the same general 
trends as the corresponding data in the earlier study (Lovell eż al. op cit.). There 
is a rise in the reading quotients of the same order of magnitude in both studies, 
and overall, we may say that as far as reading ages are concerned, the children 
in the earlier study, who had had remedial teaching for only one period a week, 
made equally as much Progress per unit time as those now under discussion. 


Furthermore, the progress made in reading per unit time since remedial teaching 
ceased is almost identical in both investigations. 


(2) There is tendency in the previous investigation for the reading quotients 
to fall between the end of treatment and follow-up. The same trend is clearly 
noticeable in both backward and non-backward groups in this study. 


headings : 


at 81-9 years of age, wh 
Not a single child of seco: 
school at follow-up. 


o are later found among the academically abler pupils. 
ndary school age in this group was found in a grammar 
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(5) The findings given in Table 6 came somewhat as a surprise. After a 
mean period of almost 3} years from the time of referral, no difference in the 
mean reading ages could be found between those who did, and those who did not 
attend the remedial centres. While the matching procedure could not be perfect, 
it was done with great care. Moreover, those who went to the centres were 
regarded as the most promising pupils. It is still possible, of course, that up to 
one year after referral, the mean reading age of those attending the centres 
might have been ahead of the mean reading age of those in school. 


(6) The non-backward group (mean I.Q. 111) made about the same 
Progress per unit time in reading as the backward group during the period of 
remedial teaching. The reading quotient rose but fell markedly again in the 
period after remedial teaching ended so that at follow-up the quotient was 
roughly the same as at the time of referral. 

(7) In the arithmetic test some two-thirds of the non-backward group 
understood place value in relation to tens and units, but the mote difficult 
questions relating to hundreds, tens and units remain poorly answered. 


(8) In spite of the high mean I.Q. of the non-backward group, nearly half 
of them were found in the bottom stream, or next to bottom stream of a three or 
more stream school at follow-up. One in five of the group was found in a single 
stream school, and roughly one in seven in the top stream of a multi-stream 
school. But of those of secondary school age at follow-up, not a single child 
was found in a grammar school. 


(9) It must be stressed that although the progress made by the majority 
of the children involved in this study was limited, the progress made was of 
great importance for the pupils concerned. To attain a reading age of 9+ years 
is a very real progress for the individual. 


(10) The correlation coefficient between I.Q. and progress made during 
treatment (Table 7) is small but significant in the backward and non-backward 
group. This clearly parallels the finding in the earlier study. 

(11) The previous study and this one, which together involved the follow-up 
of five hundred children, compared children taught singly, and in small groups, 
for one period per week, with children of somewhat higher I.Q. taught full-time 
in remedial centres. Whatever arrangements we have, the long-term results 
seem to be much the same. This surely suggests that we know very little as 
yet about the aetiology of scholastic failure and indicates the great need for 
research, 


ACKNOWLEDGEMENTS.—Our thanks are due to the Chief Education Officer 
and the Senior Educational Psychologist of the Local Education Authority 
concerned for their generous help in conducting the enquiry and for permission 
to publish our findings. 
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DEVELOPMENTAL PATTERNS OF SEVERELY SUB- 
NORMAL CHILDREN 


By MARY WOODWARD* anp DIANA J. STERN} 
(Fountain Hospital, London) 


Summary. LEighty-three severely subnormal children aged under 9 years were 
observed with the aim of comparing their development in different areas and of 
examining the relation of various aspects of development to Piaget’s sensori- 
motor stages. It was found that locomotor development was significantly in 
advance of speech development. Children classified at various sensori-motor 
stages differed significantly in speech and in locomotor development. Certain 
developments in language, drawing, performance ability and social responses 
were associated with the attainment of the last sensori-motor stage, thus 
suggesting that the achievement of the end of the sensori-motor period is an 
important step in the development of severely subnormal children. 


I.—IntTRODUCTION. 


A PREVIOUS study (Woodward, 1959) investigated a method of classifying the 
severely subnormal in terms of the six stages of sensori-motor development 
which have been distinguished by Piaget (1953 and 1955). The results indicated 
that severely subnormal childien pass through these steps in the same sequence 
as Piaget suggests for normal children up to the age of two years. The aim of 
the present enquiry was (i) to determine the locomotor, language and social 
development of children classified at these sensori-motor stages, and (ii) to 
examine patterns of development in young severely subnormal children. 

The main theoretical interest attaches to the comparison of language 
development and the Piaget stages. At the last sensori-motor stage, children 
solve problems in such a way as to suggest that they are able to anticipate the 
future position of an object before they move it, and they persistently search 
for hidden objects in a manner which suggests that they are aware of the perm- 
anence of objects. According to Piaget (1950), this behaviour indicates that the 
child has some means of representing objects to himself in their absence, and 
this implies the development of ‘a symbolic function’ which makes possible 
the systematic acquisition of language. This suggests the hypothesis that 
severely subnormal children will make a similar development in language at the 
end of the sensori-motor period. 

II.—SUBJECTS. 

The subjects were resident at the Fountain Hospital, London. The total 
hospital population born during 1951 or later was selected in the first instance 
as the group to be investigated. Children were then 
if they had any condition, revealed in the medical examination, such as 
cerebral palsy, gross hydrocephalus, or a visual or auditory defect, that is likely 
to retard any aspect of development. In addition, 
over 50 were omitted from the study. This left a 
During the following eighteen months twenty-nine 


to the group ; these were consecutive cases wit 
This yielded a total 


{ hs (mean age=4-85 years). Three 
-Merrill 1.Qs. of 42, 41 and 31 ; the rest failed one or more 


hi ie °. . 5 
below the lower limit of the ph s scale (forty-nine failed all six), thus falling 


* Now at University College, Swansea. 
tł Now at Child Guidance Clinic, Bexleyheath, Kent. 
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II].—MeEtHop. 


The main classification was made in terms of Piaget’s six sensori-motor 
stages. Since these stages have been described in detail in the earlier study 
already referred to, only the main criteria are given below. 


Stage I (birth to 4/5 weeks). 
Use and extension of reflex activities. 


Stage I] (1 month to 4/5 months). i Ad, 
Integration of reflex activities with higher activities, e.g., co-ordination 
of vision and hearing and vision and grasping. 


Stage III (4/5 months to 8/9 months). 

Repetition in the same form of chance-made movements that produced an 
effect on objects that the child saw or heard (e.g., shaking a rattle). Following 
the line of fall when an object is dropped and recovering half-hidden objects. 


Stage IV (8/9 months to 11/12 months). 

At the beginning of the stage, removing an obstacle in order to grasp a toy 
which is visible: at the end, removing an obstacle when the object is hidden 
but searching only in one place. Visual examination of objects by turning them 
round and round and holding them at various distances. 


Stage V (11/12 months to approximately 18 months). 

Obtaining an object that is out of reach by pulling a string attached to it, 
or the support on which it stands ; searching briefly ir different places when an 
object is hidden. Exploratory manipulations which vary the effect on objects, 
e.g., banging a toy successively on different-sounding surfaces. 


Stage VI (approximately 18 months onwards). 

_ _ Inferring the result of an action before the object is moved, e.g., using a 
stick in order to obtain a toy that is out of reach ; searching for objects for a 
longer period (i.e., awareness of the permanence of objects is acheived). 

More advanced manipulation was investigated by the following performance 
tasks from the Revised Stanford-Binet (Terman and Merrill, 1937) and Merrill- 
Palmer (Stutsman, 1931) scales of intelligence : 

Four-brick tower. 

Three-figure form board. 
Three-figure form board rotated. 
Wall in peg boards A and B. 
Block bridge. 

Bead threading. 

Seguin form board. 

Little pink tower. 

Six-cube pyramid. 

If a child did not succeed with any of the first four tasks, the rest were not 

ven. 

It was subsequently found necessary to derive a performance score for each 
child. One point was awarded for each all-or-none task that was passed, accord- 
ing to Merrill-Palmer and Stanford-Binet criteria of success ; timed tasks (e.g., 
peg boards and Seguin form board) were given weighted scores on the basis of 
speed. Refusals were allowed for as in the Merrill-Palmer method of scoring 
(Stutsman, 1931). 


12 Developmental Patterns of Sub-normal Children 


The subjects were in the phase of development which, in normal children, 
would be assessed by means of developmental scales. These were used in the 
present enquiry as a basis for the assessment of speech and locomotor develop- 
ment, but experience of administering such scales to severely subnormal childien 
suggested certain modifications. 


The locomotor section of Gesell’s schedule (Gesell, 1940 and 1941) was 
used, with the exception of items for which special equipment, such as walking 
boards, was needed. Gesell’s instructions for administration were followed. 
In cases of doubt additional information was obtained from nursing staff. 
At the more advanced levels, 2} years onwards, some items (e.g., standing on 
one foot) require the child to imitate the examiner. The co-operation of with- 
drawn children could not be obtained for these items, and teaching and nursing 


staff could not supply information on them. Thus, these imitative items were 
omitted from the scoring for all children. 


The language sections of developmental scales include, in addition to 
vocalisation and comprehension items, others relating to response to sound, 
manipulation of a bell and social responses. When emotionally disturbed 
children obtain low scores on language scales because they fail to respond to 
sounds, toys and people, the result does not accurately describe their language 
development. In this study only items of vocalisation and comprehension 
were included. Severely subnormal children frequently show scatter on a 
language scale, verbal comprehension being relatively more advanced than 
vocalisations. Thus, these two aspects of language were treated separately. 


For speech development the vocalisation items were abstracted from Gesell’s 
scale, together with the age norms assigned to them by him. Scores in verms 
of‘ speech ages ’ were based on these norms. Records were made of the type of 
vocalisations uttered by the child during the examination, and further informa- 
tion on habitual sounds was obtained from nursing and teaching staff. Some 
children made no othei sounds except grunts and squeaks, and, although these 
occur as items in Gesell’s scale, speech ages were not assigned on the basis of 
these alone, so that these children were omitted from the analysis of the results 


on speech. The amount of the child’s spontaneous vocalisation during the 
examination was rated on a three-point scale. 


The following items were used for the investigation of comprehension : 


Response to a gestured instruction. 

Response to simple commands in a concrete situation (e.g., “ Shut the 
door, sit down, stand up”). 

Comprehension of ‘give’ in relation to a real common object (e.g., 
cup, spoon, ball). 

Comprehension of ‘ show ’ in relation to a real common object. 

Response to the question: “ Would you like a sweet? ” 

Response to the statement : “ There’s a sweet in the box.” 


e simple commands item No. 6 of year II of the Terman-Merrill 
scale (Form L). 


Data collected on Social development included the child’s social response to 
the examiner and to a familiar nurse, his response to difficulty, and any attempts 
anon or at drawing the examiner’s attention to his activities. 
ae ation o e achievement of feeding skills was obtained from nursing 
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IV.—RESULTs. 
(a) Performance ability. 


The number of children falling in each of Piaget’s sensori-motor stages is 
shown in Table 1, together with the aveiage age of the children in each stage. 


TABLE 1 


SENSORI-MOTOR STAGE AND LOCOMOTOR AGE. 


Locomotor age in years 


Sensori-motor Mean = 
Stage a C.A. (yrs.) Highest point 
Basal age |————————- 
Mean Range 


0-38—0-77 
0-54—2-0 
0-77—3-0 
0:94—4-0 


Twenty-five of the thirty-four Stage VI children succeeded with two or 
more of the performance tasks, their scores ranging from 1 to 21, but no child 
below Stage VI succeeded with any of them, a difference which is significant at 
beyond the 0:001 level (x°=35-4, d.f.=1). Of the nine Stage VI children who 
failed all these items, five showed signs of severe behaviour disorder in the form 
of visual avoidance, or prolonged distress. The other four were the youngest 
at this stage, aged 3 years to 4 years 4 months. 


(0) Locomotor development. 


Two locomotor ages for each child derived from the Gesell scale as follows : 
(1) The age level at which the child succeeded with all the items (basal age). 
(2) The highest point at which a success was achieved. 


The locomotor scores of children in each sensori-motor stage are shown in 
Table 1. From this it can be seen that the average locomotor age increases with 
successive sensori-motor stages, but there is a wide range at each stage. Never- 
theless, the difference between the means for the highest point score of Stages 
V and VI are statistically significant (t=8-42, p<0-001, d.f.=61). In view of 
the small numbers in Stages III and IV and of the small differences between 
the means, these two stages were combined ; the resulting average is significant- 
ly lower than the Stage V average (t=4-75, p <0-001, d.f.=47). In no instance 
was a child’s locomotor age (highest point) level with his chronological age. 


A difference between the basal age and the highest point occurred in thirty- 
six cases. Among the children who could not walk without support there was a 
consistent pattern in the type of item which accounted for the scatter. The 
details are given in Table 2 for the twenty-one children whose upper limit 
(highest point) was below one year. 
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TABLE 2 


ANALYSIS OF ITEMS PASSED AND FAILED BY TWENTY-ONE CHILDREN SHOWING SCATTER ON 
THE GESELL LOCOMOTOR SCALE. 


Type of item passed (+-) and 
passed (—) at highest point 
Highest point of — -———| Range of scatter between 
locomotor develop- . Standing last standing item + and 
“ment Sitting items items last sitting item + * 


8 to 24 weeks 
12 to 28 weeks 
4 to 8 weeks 
No standing items at 
these ages. 


* Age level at which last success achieved for a sitting item subtracted from age level at 
which last success is achieved for a standing item. 


It will be noted that in nineteen cases the success at the highest point was a 
sitting item, and in one case only it was a standing item, and that a sitting item 
was failed at the highest point in only one case. Standing items, on the other 
hand, were failed at the highest point by fifteen of the sixteen children whose 
highest point was thirty-two weeks and over (there are no standing items below 
the age of thirty-two weeks). Thus, the scatter in the majority of cases was due 
to failure to show various standing responses over a range of the scale at which 
sitting responses were present. New admissions, who had recently come from 
home did not differ significantly from those who were already resident ia hospital 


either in locomotor score or in the relative number showing a scatter over the 
locomotor scale. 


Locomotor score (highest point) was positively correlated with performance 
score among the Stage VI children (rho=0-68, p=0-001). Hence, this factor 
was controlled in group comparisons, as was chronological age also, though a 
significant correlation between age and locomotor score (highest point) within 


any stage was found only for the group at Stage VI with performance scores of 
0 (tho=0-625), 


_ In view of the tendency of Mongols to lower muscle tone (Hilliard and 
Kirman, 1957), their locomotor development was compared with that of non- 
Mongols. Twenty-three Pairs were matched for age, Piaget stage and perform- 
ance score, the average ages of the two groups being 4-96 years and 5-025 years, 
respectively. The mean locomotor age (highest point) of the Mongols which was 
1-76 years (S.D.=0-98) differed very little from that of the non-Mongols 
(M=1-77, S.D.=0-9 ). The basal age means were similarly close (1-63 years 
and 1-64 years). 
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(c) Speech development. 


In fifteea cases no sounds (apart from squeaks, crying and laughing) were 
heard during the examination, and the nursing staff similarly reported having 
heard no other sounds from these children. The ‘ speech age ’ of the remaining 
sixty-eight children is shown in Table 3. 


TABLE 3 


Locomotor AND SPEECH DEVELOPMENT, 


Locomotor age Speech Age 

Sensori-motor Highest point 
Stage Mean Mean Range 
(years) (years) 


0-487 0-23—0-77 
0-403 0-15—0-69 
0-784 0-54—1 -25 
1-196 0-62—2-0 


The first interesting feature of this table is the wide range in speech age at 
any Piaget stage ; the second is the increase in the upper limit of speech develop- 
ment with successive stages. At Stages III and IV the most complex sound 
made by any child was a four-syllable babble ; jargon, interwoven with single 
words, was the most advanced development at Stage V. Two Stage V children 
produced two-word combinations in an echolalic manner, but meaningful and 
relevant word combinations and short sentences occurred only in the speech of 
some of the Stage VI children. 


The difference between the means of the Stage V and Stage VI children is 
statistically significant at beyond the 0-001 level (t=3-92 for 52 d.f.). The 
Stage V average is also significantly different from the combined mean of 
Stages IV and III (t=3-52, 0-01> p> 0-001, d.f.=33). 


In view of the mutism that is sometimes associated with severe behaviour 
disorders, it was postulated that the absence of speech sounds, or very retarded 
speech development, would be related to behaviour disturbance in children 
at Stages V and VI, among whom absence of speech could scarcely be attributed 
merely to extremely slow development. Very retarded speech was defined as 
up to and including twenty-eight weeks (vowels only) for Stage V and up to and 
including thirty-six weeks (two-syllable babble) for Stage VI. Behaviour 
disorder was defined by marked visual avoidance or prolonged distress when 
approached, or mannerisms which warded off social contact. Eight of the 
eighteen children at Stages V and VI who were extremely retarded in speech 
development, or who made no sounds, showed signs of behaviour disorder, 
compared with three of the remaining forty-five children at these stages ; this 
difference is significant between the 0-01 and 0-001 levels (x?=10-25, with Yate’s 
correction applied, d.f.=1). 


Among the children at Stage VI, speech development was positively 
correlated with performance score when the four children with severe behaviour 
disorders were omitted (rho=0-38, p=0-05). Rank order correlations of age 
and speech development within each Piaget stage and performance score group 
Were not significantly different from zero in any instance. 
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The bearing of the period of residence in hospital on speech development 
was examined by comparing the speech development of twenty-three new 
admissions and twenty-three others matched for age, Piaget stage and perform- 
ance score where applicable. The average speech ages of the two groups were 
0-83 years and 0-81 years, respectively. (When new admissions were excluded, 
there were too few children left at any stage for length of stay in hospital to be 
correlated with speech age.) 


Twenty-two pairs of Mongols and non-Mongols were matched for age, 
Piaget stage and performance score, A mean difference of 0-126 years in speech 
age, favouring the Mongols, was found, but there were six children with severe 
behaviour disorders in the non-Mongol gioup and non among the Mongols. 
Excluding these six children and their Mongol pairs, the mean speech ages for 
the remaining sixteen pairs were 1-028 years (S.D.=0-405) for the Mongols and 
0-896 (S.D.=0-5) for the non-Mongols ; this difference was not statistically 
significant. 

A comparison of speech and locomotor development in the same children 
within each Piaget stage is made in Table 3, excluding the children who made no 


speech sounds. It will be seen that, for all groups, the average locomotor age is 
considerably in advance of the average speech age. 


For the whole group of sixty-eight children, the average speech age (0-92 
years) was significantly lower than the average locomotor age (1:82 years) for the 
highest point score. (t for correlated means=9-82, p <0-001, d.f.=67). 


In fifty-five of the sixty-eight cases (81 per cent.), both the highest point of 
locomotor development and the basal age were higher than the speech age, In 
four cases one or both was the same as the speech age ; both locomotor ages 


were lower than the speech age in four cases, and in the remaining five cases 
only the basal age was lower. 


(4) Verbal comprehension. 


None of the children classified at Stages III and IV showed any under- 
standing of speech, nor did twenty-five of those at Stage V, nine of whom 
responded to instructions given by gesture. The remaining four Stage V 
children gave evidence of verbal comprehension in simple concrete situations. 
All the children classified at Stage VI, on the other hand, showed evidence of 
some verbal comprehension. Fourteen of them responded only at the level of 
simple concrete situations ; of these nine failed the other items listed in Section 
III, but the other five were too disturbed for these to be administered. The 
remaining twenty children succeeded with three or more of the items listed in 
Section III (including concrete situations and excluding gestures). Comparing 
Stages V and VI for the number of children in three categories of verbal com- 
prehension (1) success with three or more items, (2) verbal comprehension in 


concrete situations only, and (3) none at all, the differences are significant at 
beyond the 0-001 level (y2=50-36, d.f.=2). 


The relation between verbal comprehension and Piaget’s sensori-motor 
stages was further investigated by an examination of clinical records, which 
contained the Piaget stage and the results of attempts to administer the 
Terman-Merrill revision of the Binet scale. Children who succeeded with one or 
more of the year II verbal items were invariably at sensori-motor Stage VI. 
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(e) Drawing. 


The drawing ability of children at the various Piaget stages is shown in 
Table 4. 


TABLE 4 


SENSORI-MOTOR STAGE AND DRAWING. 


Sensori-motor stage 


Drawing 


< 
< 
pa 


Faint marks or dots . 
Straight scribble 
Circular scribble . . 
Copy of lines 

Copy of circle, partial* 
Copy of circle, correct 
Copy of Cross 


LTT} ess 


SKLHAGDKWNWS 


to 
© 
o 
i) 


* By circular scribble. 


It will be seen that the use of a pencil begins to occur at Stage V, but 
extends to vigorous scribbling in only two cases. All but three of the Stage VI 
children, on the other hand, scribbled freely or copied models of straight lines 
and circles. (One refused to handle the pencil.) The differences between Stages 
V and VI, in three drawing categories ((1) none, (2) marks or dots, and (3) 
scribbles or copies) are significant at beyond the 0-001 level (x*=28+1, d.f.=2). 


(f) Feeding skills. 


Feeding skills achieved by children at various Piaget stages are shown in 
Table 5. 
TABLE 5 


SENSORI-MOTOR STAGE AND FEEDING SKILLS. 


Sensori-motor Stage 


Iv 


Does not drink 

Drinks if cup held 

Helps to hold CUP _ is: 
Holds cup without help . 


Does not feed self with fingers 
Feeds self with fingers 


Spoon: Does not use 
Uses and spills... 
Uses without spilling 


Total in each stage 
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(g) Social responses. 


All the children in the group showed awareness of the investigator and a 
nurse or teacher, most by looking at the investigator when she approached ; 
the social awareness of some was indicated in a negative way by visual avoid- 
ance. Others were distressed in the presence of the investigator, or began a 
mannerism or slapped their faces when approached. 

The response of smiling was found at all stages, though not all the children 
in a given stage showed it ; the incidence was naturally affected by the behaviour 
described in the previous paragraph. Few children made a vocal response 
when talked to by the investigator, but spontaneous vocalisations occurred in 
more cases. A rating of | (few or no spontaneous vocalisations) compared with 
a rating of 2 or 3, was given to significantly more of the children who showed 
signs of behaviour disorder than to the rest (y*=20-02, p. <0-001, d.f.=2.). 

Differences in the amount of vocalisation (assessed by the ratings) between 
new admissions and children already resident in hospital were not statistically 
significant. 

Communication, in terms of imparting information or making a demand, 
was observed mainly among the Stage VI children, twenty-two of the thirty- 
four pointing for this purpose, with or without a vocal accompaniment. Only 
two Stage V children communicated, and none below this level did so. , 

Other social responses were observed only ‘among the Stage V I children. 
Seventeen (50 per cent.) of them drew the investigator’s attention to their 
activities, or kept looking to make sure the investigator was watching their 


performance ; and twelve children sought the investigator’s help when mm 
difficulty. 


V.—DIscussION AND CONCLUSIONS. 


There are two main questions arising from these findings that call for 
discussion. One concerns the pattern of development in severely subnormal 
children, and the other is the relation of other aspects of development to Piaget's 
sensori-motor stages. 

The group is too deviant for locomotor and speech scores to be equated in 
terms of standard scores (in any case Gesell does not provide the necessary data 
for this). The expression of the results in terms of ‘locomotor ages’ and 
“speech ages’ is probably inexact, and consequently comparisons of rates of 
development in different aspects are approximate. Nevertheless, the difference 
between the locomotor score and the speech score of the whole group is so large 
that it would appear that the severely subnormal, although grossly retarded in 
all areas of development, tend to develop much more rapidly in the locomotor 
sphere than they do in the acquisition of speech. If the approximate ages 
suggested by Piaget for the sensori-motor stages are accepted, then at all stages 
(with the possible exception of III) locomotor development appears to be in 
advance of the development of sensori-motor intelligence, which, in turn, is in 
advance of the development of speech. 

The twin study of Gesell and Thompson (1929) indicates that locomotor 
development is mainly influenced by maturation, since intensive training made 
little difference to subsequent locomotor scores. Much of Piaget’s account of 
the development of sensori-motor intelligence is a description of a learning 
process, and learning obviously enters into the acquisition of speech. Thus, it 
can be postulated from these results that the severely subnormal develop 
relatively more rapidly in the aspect which is mainly maturational, and that 
they develop relatively more slowly in aspects which depend more on learning- 
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Another finding on locomotor development which calls for further investiga- 
tion is that of a large scatter over the locomotor scale by the majority of the 
children who could not walk. This suggests that steps in the sequence of 
behaviour that lead to the attainment of the upright posture do not, in the 
severely subnormal, correspond with steps in the development of the sitting 
posture as in normal children. One possible psychological factor accounting for 
this is a difference in the amount of individual attention and encouragement to 
stand received by children in hospital and at home. New admissions did not, 
however, differ significantly from the children who had been resident in the 
hospital for a longer period, either in locomotor score, or in scatter over the 
locomotor scale. Although children with cerebral palsy were excluded from 
the study, it is, of course, possible that there were motor lesions which were not 
observable clinically. Comparison of these data with post-mortem findings 
would be valuable. 

The limited verbal stimulation of children in hospital might also be put 
forward to account for the finding of greater retardation in speech development, 
relative to locomotor development and the sensori-motor stages. The results of 
a number of studies (reviewed by McCarthy, 1954) have indicated that normal 
children living in institutions develop language more slowly than children living 
at home. A similar finding in respect of severely subnormal children has been 
reported by Lyle (1960). However, in the present study new admissions did not 
airtfer in average speech score from children already resident in hospital. This 
finding is not necessarily at variance with Lyle’s, since he investigated older 
children whose language was more advanced. They could identify and name 
objects, whereas the present study was mainly of pre-verbal utterances. It is 
thus possible that different types of environment become more important once 
words are beginning to be systematically acquired. The studies of normal 
children, however, give conflicting results as to the age at which an unstimulating 
environment begins to affect language development, some reporting an effect 
on phonemes in the first few months and others finding a difference only after 
the age of 2 years (McCarthy, 1954). The hypothesis that an unstimulating 
environment influences language development only after the beginning of the 
systematic use of speech might also account for the different results in respect 
of Mongols in this study and in Lyle’s. Lyle (1960) found that Mongols, whether 
living in hospital or at home, were more backward in language than comparable 
non-Mongols. The Mongols of the present group did not differ significantly 
from non-Mongols. ' 

Further study might usefully be made of the findings of the present study 
that children with severe behaviour disorders tended to be very retarded in 
speech development or to make no sounds. This result also indicates that 
behaviour disorder is a variable that should be controlled in comparative studies 
of language development. 

The second main question to be discussed is whether children classified into 
Piaget’s sensori-motor stages are clearly differentiated in other aspects of 
behaviour. For most aspects of development there is little difference between 
Stages III and IV. This is probably to be explained by the fact that all the 
children at Stage IV were only at the beginning of this stage. This appears to 
be a transitional stage, and very few of the severely subnormal children who 
have been observed have been at the end of the stage. Children are classified 
at the beginning of Stage IV if they brush aside a transparent screen in order to 
obtain a toy that it covers, though in the extent of their awareness of th 
anence of objects they are the same as those at Stage III (reco 
hidden object, but not a completely hidden one). 


| the perm- 
vering a half- 
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However, significant differences in both locomotor and speech development 
were found between successive stages (III and IV together compared with V, 
and V compared with VI). Children at Stage VI were significantly differentiated 
from those at Stage V in the development of verbal comprehension. It is also 
of particular interest that it is only Stage VI children who can succeed with, or 
even comprehend, the first verbal items of the Revised Stanford-Binet scale. 
For these it is necessary to identify objects by their names and to carry out 
simple instructions about objects. This finding supports Piaget’s contention 
that the achievement of Stage VI, which involves a concept of the permanence of 
objects, is necessary for the acquisition of meaningful speech. Some children 
in the present group who were at Stage VI did not use meaningful speech, or 
identify objects by name ; whether these only lacked sufficient stimulation, or 
whether another factor is involved, as a matter for further investigation. 


It was also only Stage VI children who succeeded with any performance 
items drawn from pre-school scales. Thus, the attainment of Piaget's sensori- 
motor Stage VI, at any rate in the severely subnormal, appears to be necessary 
for success with the earliest items, both verbal and non-verbal, of pre-school 
scales which are applicable to normal children from the age of 14 to 2 years. 


Results for other aspects of behaviour also underline the importance of the 
attainment of the end of the sensori-motor period in the development of severely 
subnormal children ; only Stage VI children were able to copy lines and a 
circle, and with two exceptions vigorous scribbling occurred only at this stage. 
Communication began to appear at Stage V but was common only at Stage VI, 
and it was only Stage VI children who sought help in difficulty, and who 
indicated that they wished their activities and achievements to be noticed by an 
adult. 


To turn to the practical implications of the findings, the first application is in 
clinical practice. All the Stage VI children in this study showed evidence of 
some verbal comprehension. Hence, if a child at Stage VI is found to have no 


verbal comprehension, then further investigation is indicated, particularly of 
the possibility of deafness. 


The results of this study also suggest when it is useful for nursing staff to 
spend time encouraging a‘child to use a cup and spoon. The findings indicate 
that children who have reaghed Stage’ VI can certainly be taught to feed 
themselves, and they also stgfest that‘it is worthwhile to persist with Stage V 


children. However, the attempt is probably fruitless until the child reaches 
these two stages. 


In the educational field the results indicate that children who have attained 
the last sensori-motor stage would benefit from attending a training centre, 
since new developments in language, drawing and performance ability occur 
during this stage, which appears from this study to be a major event in the 
development of severely subnormal children. 
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ARE RURAL CHILDREN HANDICAPPED BY THE USE 
OF SPEEDED TESTS IN SELECTION PROCEDURES? 


By 
C. ANNE MORETON 
(Charlotte Mason Training College) 
AND 
H. J. BUTCHER 
(Department of Education, University of Manchester) 


Summary. Itis widely believed in country areas that the country child is at a 
disadvantage in the working of speeded tests. This hypothesis was tested by 
the comparison of two samples, each of about 300 children aged 10-11, one from 
rural Westmorland, one from central Manchester. Power and speed scores were 
obtained from N.F.E.R. tests of Arithmetic, English and Verbal Reasoning. 
Almost all net differences were statistically significant in the expected direction, 


tending to confirm the hypothesis, but were not sufficiently large to suggest a 
need for revision of selection procedures. 


I.—Inrropuction. 
In the course both of teaching in rural schools and a 
practice in a rural area, one of the authors has fre 
from teachers and parents that the country chil 
working of speeded tests used i 
often attributed to the sl 
that in urban areas. 


Zhe present research is confined to testing the hypothesis that such a 
disadvantage exists, without any detailed exploration of possible reasons or 
correlates. But it is believed to be the first systematic enquiry of its kind and 


may be timely in view of recent renewed interest in the concepts of speed and 
power in mental tasks (e.g., Furneaux, 1960). 


lso of supervising school 
quently met with complaints 
dis ata disadvantage in the 
n selection procedures. This disadvantage is 
ower tempo of life in the country as compared with 


i on, and a pure test of power impracticable. 
Furneaux (1960) is probably correct in holding that such concepts as ‘ speed,’ 


Ba .accuracy’ and ‘ difficulty’ require a new and more operational 
definition if conflicting results are to be avoided. However, certain trends— 


one might almost Say fashions—in research do z r to emerge. 
In the earliest rès ment i 


; carches there was disagreement between those who main- 
tained that Speed of reaction was positively correlated with higher mental 
Processes (Gilbert, 1894, aguley, 1901) and those who found no such relation- 
ship (Binet, 1900 ; Wissler, 1901 ; Whipple, 1904). As a result of further research 
(Burt, 1909 ; Brown, 1910; Kruger and Spearman, 1914; May, 1921; Bern- 
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stein, 1924; Spearman, 1927), the view that such a relationship does exist 
became generally accepted, although the relative importance and predictive 
power of these two components of mental ability is still not entirely settled. 


Since about 1930, attempts have been made, foreshadowed by Bernstein 
(1924), to interpret speed of mental work as a personality factor (e.g., Kennedy, 
1930 ; Himmelweit, 1946). In contrast, Eysenck (1953), discussing the work of 
Furneaux, has maintained that speed is the fundamental component of ability, 
an that measures of power, which depend on persistence, involve personality 
actors. 


Broadly speaking, however, the orthodox view appears still to be, firstly 
that speed and power are highly correlated, secondly that power is an important 
component, if not the more important component of general ability, thirdly, 
that tests, as used for example in the 11-}- selection procedure, achieve an 
adequate compromise. Thus, Peel (1956), commenting on the kind of criticims 
that gave rise to the present investigation, concludes both that in general at 
11+ “ the abler child is the speedier child,” and that in any case the time-limit 
imposed on most group tests in practice is not so stringent as to make speed an 
over-riding factor. 


Among psychologists as well as among the general public there are some 
who might still query this view (Porebski, 1954; Myers, 1952, 1960; Mollen- 
kopf, 1960). Myers, for example, writes : “ In timed selection and achievement 
tests, we may be defining the purposes of education as including, perhaps 
emphasising, the teaching of the value of snap judgment and educated guessing, 
over the value of deliberate careful thought.” 


In view of the continuing interest in these questions, it is mildly surprising 
to find how little research has been done into the social correlates of power and 
speed. Klineberg (1928), Lambeth and Lanier (1933) and Knapp (1960) 
investigated racial differences, but urban-rural differences appear to be relatively 
unexplored. The most relevant study known to the writers is that of Smith 
(1948), whose conclusions about Hebridean children include the following : 


“Tt is clear in these circumstances that the time factor has a profound 
effect on test results over and above the normal effect. A rigidly timed test 
does not seem to discriminate very well in the range of the intelligence scale, 
because mental processes which are more slowly initiated and pursued than 
is usual are constantly being interrupted and suspended by the imposed speed 
of the examiner . . . There is little evidence of the modern tendency to race with 
time. It is rightly considered infinite and unlikely to interfere with the expected 
sequence of things ; so in the normal course of events there is no need for undue 
haste. Probably if the children had grasped more fully the significance of the 
stop-watch in the experiment, the scores would have been more flattering to 
them.” 


Smith’s research was primarily concerned with bilingual children, and 
there can be few rural areas comparable in isolation with the Outer Hebrides, 
yet it seems reasonable to postulate a gradation of natural tempo between such 
children and urban children. The aim of the present research is to determine 
whether English rural children in a relatively isolated area are handicapped to 
some degree in a similar way. 
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III.—CHOICE OF SAMPLES. 


(a) The Manchester sample. l 

The Manchester sample of 297 was drawn from six schools in a lower 
working-class area near the centre of the city. As will be seen from Table 1, 
the largest number from any one school was 74 and the smallest number 36. 
In the total sample there were 153 boys and 144 girls. The average age was 
10°11. 


TABLE 1 


School No. 1 was a junior mixed school on the edge of a rapidly deteriorating 
area and, according to the headmistress, has over the past five years lost a 
number of good working-class families who have moved out to one of the newer 
estates. Two coloured boys were concerned in the tests, a half-caste and a 
Nigerian ; many coloured families are now invading this part of Manchester. 
There were several ‘problem’ families, but on the other hand, still many 
families who had lived in the district for one or two generations, 


School No. 2 was a junior and infant school. Again, there was some 
evidence of deterioration, both to Property and living standards in this area, 
although on the whole, the school had maintained its standards. There were 
several coloured children in the school who seemed to be at least of average 
ability and who also seemed to fit in very well with the community. 


School No. 3 was a junior school. It was in a similar type of area to that of 
School No. 2, although here there was more evidence of deterioration of 


property and living standards. The coloured children did not appear to be of 
the same calibre as those in the previous school. 


School No. 4 was a junior school. This school was in rather a better resi- 
dential working-class area but the population of the school was a little mixed 
and there was some slight evidence of a deterioration in the background and 
living standards of the children over the past few years, although actually the 
proportion of those selected for grammar schools was above average, 

School No. 5 was what is 
large floating population and 
houses. There are about thirty coloured children in the school. It is interesting 
to note that at this school a coloured boy who took no extra time for any of the 
ore and quotient and also the highest 
Manchester sample in Arithmetic. 
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School No. 6 was the nearest to the centre of the city. Some of the children 
were from very good working-class homes with interested parents, others were 
from poor homes both economically and socially. There was a small proportion 
of coloured children in the school and also a variety of nationalities (e.g., 
Spaniards, Nigerians and Indians) with attendant language difficulties. Con- 
sidering the prevailing living conditions, the number going forward to either 
grammar or technical schools or grammar-stream in secondary modern schools 
was relatively high. 


(b) The Westmorland Sample. 


The Westmorland sample of 274 was drawn from all the rural schools in the 
county except two which had to be withdrawn owing to administrative difficult- 
ies. The largest number from any one school was twelve and the smallest 
number one. There were 142 boys and 132 girls in this sample, with an average 
age of 10-8. The total number of schools involved was fifty-eight, the largest 
number on roll being ninety-six, and the smallest six. There were twenty-five 
one-teacher schools, twenty-seven two-teacher schools, five three-teacher 
schools and one four-teacher school. It is probably true to say that the 
majority of the children in these schools were the children of agricultural 
workers, although there is a slight but steady trend towards an increasing 
proportion of children whose fathers work in the towns and travel out each day. 
The majority of such parents are office workers and/or members of the pro- 
fessions. 


Considerable care was taken to confine the sample to genuinely rural areas, 
by the omission of even small towns and of areas that are fast becoming 
urbanised. The number of old Westmorland family names gives support to the 
assumption of a strong ‘native’ element. There are for example, thirteen of 
these known to the writers, all of which occur on an average four times. It is 
not claimed, however, that the two samples are equated for socio-economic 
status, and it is probable that the Westmorland sample contained a higher 
proportion of children from middle-class homes. 


IV.—MEtHop. 
(a) The tests used and their administration. 


The tests used were the NFER Arithmetic Test 8, English Test 8, and 
Verbal Reasoning Test 8a. Considerable practical difficulties were envisaged in 
conducting a programme of testing over a wide rural area with poor communica- 
tions and a high proportion of one-teacher schools, in which such a programme 
can easily disrupt the whole school. 


The solution adopted was that the Westmorland results should be based on 
the Authority's preliminary testing, with some additional instructions to 
supervisors, and that the same tests should be administered to the Manchester 
children under as closely similar conditions as possible (but without, of course, 
forming part of the Manchester Authority’s official programme of testing). 


Such a solution is not ideal, and might even introduce differences in absolute 
level of performance, but it was considered unlikely seriously to disturb the 
relative differences between speed and power which form the main topic of 
investigation, 


Stand. | Raw 
Score | Score 
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Instructions to Supervisors. _ nd l 
In addition to the usual instructions for administration issued with the 


tests as to strict time limit, supervision and marking, the following instructions 
were sent to each school : 


Additional Insiructions to Supervisors 
Arithmetic Test 8 (50 Minutes), English Test 8 (50 Minutes) 
Verbal Reasoning Test 8a (45 Minutes) 
(1) After each test has been in progress for 10 minutes, instruct the testees to 
place an ‘X?’ in the margin. 
(2) Record time taken by each testee to complete test within the official time- 


limit. This to be recorded on list containing names and ages (in years and 
months). 


(3) At the earliest convenient time 
to have back their tests if the: 
red pencil. (N.B. 
completed.) 


(b) Types of Score desired. n 
The ten-minute score is referred to as a Time-Limit Score following 
Davidson and Carroll’s (1945) definition as “ the number of items correctly 
answered within a prescribed time-limit.” The score achieved on the normal 
working of the test, i.e., on the first day and within the ‘ official ’ time is referred 
to as the Speed Score. The distinction between ‘ time-limit score ’ and ‘ speed 
score ’ is thus one of degree rather than of kind. Finally, the total score includ- 
ing any red pencil entries, is called the Power Score. Thus, the experiment was 


designed to give nine scores in all for each child, consisting of three types of 
score on each of three tests. 


(c) Method of standardisation, 


___ Tn order to test our hypothesis, it was necessary to make power scores, 
time-limit scores and speed scores all comparable. Conversion tables 
incorporating an age allowance and giving quotients were available in the case 
of speed scores, but not, of course, in the case of the other two types of score, 
which were peculiar to the experiment. 3 

The power and time-limit Scores were, therefore, scaled to the same mean 
and standard deviation as the speed scores for the combined sample. It was 
then possible to convert these into quotients comparable with the speed 


quotients. A section of one of the conversion tables used is reproduced in 
Table 2. 


(possibly the following day) allow testees 
y are not complete, and to complete them in 
—All items omitted on the previous occasion must be 


TABLE 2 
SECTION OF CONVERSION TABLE (ARITHMETIC TEST). 


(Ages 10 years—10-11 years) 
T 


Age in years and completed months at time of test 


—1-723 
—1-678 
—1:571 
—1-588 
—1-544 


| 10-0 | 10-1 | 10-2 | 10-3 | 10-4 | 10-5 | 10-6 | 10-7 | 10-8 [10-9 10-10 | 10-11 
79 | 78 


84 


3 75 | 74 
86 


5 | 


— ———————s ce 
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V.—RESULTS. 


(a) A direct comparison of the two samples on speed, power and time-limit scores. 

Table 3 summarises the comparative performance of the two groups on 
the three tests and under the three conditions of work. The power and time- 
limit scores have been made comparable with the speed scores by the procedure 
already described, which also incorporates an age allowance : 


TABLE 3 


MEANS AND STANDARD DEVIATIONS—CONVERTED SCORES. 


ARITHMETIC 


MANCHESTER WESTMORLAND 
Level of 
Means S.D. Means SD. Signif. 


Speed 105-32 13-49 | 104-83 12-21 N/S 
Power 104-42 | 12-9 | 104-55 | 12-99 N/S 
Time-limit ....| 106-29 13-71 | 103-17 13-33 1% 


ENGLISH 


MANCHESTER WESTMORLAND 


103-94 13-47 | 101-82 11-14 
103-36 13-11 102-18 12-04 
104-16 13-19 | 101-87 12-02 


VERBAL REASONING 


MANCHESTER WESTMORLAND 


Speed 104-44 12-40 | 102-16 11-78 
Power 104-07 12-44 | 102-58 12-12 
Time-limit ....| 104-34 12-25 | 100-65 13-17 


It can be seen from Table 3 that any significant differences are in favour 
of Manchester gioup. Furthermore, all these differences are in respect of the 
speeded conditions of work (i.e., ‘ speed ° and ‘ time-limit ’). Thirdly, in each 
test, the more highly speeded the conditions the greater or more significant 
the difference in favour of the urban children. This is particularly noticeable 
in the case of arithmetic, but applies in some degree to the other two tests as 
well. 


(b) A comparison of the two sample. on difference scores (e.g., power minus speed). 


The results reported in the Jast section already suggested that the urban 
children were superior to the rural children to a markedly greater degree in 
the speeded than in the unspeeded conditions, although absolute differences 
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i i 7 is depends on a study of 
all. But a crucial test of the main hypothesis dep! : 
cesar differences, i.e., a comparison of the urban and rural children on 

power-score minus speed-score. 


Accordingly, the difference scores were tabulated and analysed, with 
results as shown in Table 4 below. 


TABLE 4 


COMPARISON OF WESTMORLAND AND MANCHESTER CHILDREN ON POWER SCORE Minus 
SPEED SCORE. 


Manchester |Westmorland| Difference Critical Significance 
Ratio Level 
Arithmetic .. x 


: 217 -05 
English sis ‘ 3-30 01 
Verbal 


Reasoning 2-89 “01 


It can be seen from Table 4 above that all three crucial differences, although 
small, are in the expected direction and are statistically significant. The size 
of the difference is largest for English and smallest for arithmetic, although the 
range is not very great, and should in any case not be interpreted too strictly, 
since the standardising procedure in the experiment ensured compatability of 
speed and power scores for any one test, but not necessarily between tests. 


A similar comparison in respect of power scores minus time-limit scores 
is shown below in Table 5 : 


TABLE 5 


COMPARISON OF WESTMORLAND AND MANCHESTER CHILDREN ON Power Score MINUS 
Time-Limit Score. 


Manchester | Westmorland| Difference | Critical Significant 
Ratio Level 


English 
Verbal 
Rehooniig | —0-27 


j] 
| 
Arithmetic .. | —1-87 
zal 
| 


5:30 01 
— 0-80 1-82 n.s. 


| 4:03 -01 


The second-order differences shown in Table 5 are again all in the hypothesised 
direction, two being highly significant, and the third close to statistical sig- 
ay ae Since the time-limit ’ scores are formally similar to ‘ speed ’ scores, 
at mae ona shorter period of testing time, it might have been supposed that 
= s epon usive ren would be obtained than in the comparison between power 
aa a r opera Sr The indication is, however, that even within the short 


8 measurement of speeded work operates 
in favour of urban children. p A oasis 


| 


— e 
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VI.—ConcLusions. 


__ l—Children from a working-class area of central Manchester were co 

with children from a rural area of Westmorland on moderately ah Wei. *t 
arithmetic, English and verbal reasoning as used in an 11+ selection procedure. 
Manchester scores were slightly higher on the average in all three papers, two 
of the three differences being statistically significant. 


i _2.—Scores were obtained after 10 minutes’ work on the forty-five minute 
or fifty-minute tests. Rather larger differences were found in favour of the 
Manchester children. 


3.—When the children were allowed unlimited time to complete the papers, 
the mean differences between the groups were reduced, and none was significant. 


_ 4—Power-minus-speed scores were calculated for each child. Westmorland 
children were slightly but significantly higher on all three papers. 


_ 5.—The hypothesis that speeded tests to some degree handicap rural 
children was considered to be confirmed. Relative differences, however, being 
of the order of one unit of I.Q., were so small as not to suggest any revision of 
the selection procedure. 
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SOME CHARACTERISTICS OF CHILDREN WITH TWO 
TYPES OF SPEECH DISORDER 


By 
T. G. CROOKES 
(St. John’s Hospital, Aylesbury) 
AND 
MARGARET C. L. GREENE 
(Stoke Mandeville Hospital, Aylesbury) 


Summary. An investigation was made of twenty successive cases of develop- 
mental speech defects. They were divided according to the type of speech 
disorder (Greene 1960) into a * Motor ’ group (eight children) and a ‘ Language ’ 
group (twelve children). In the Motor group, but not the Language, there was 
evidence of delay in general motor maturation. The two groups were compared 
on measures of mechanical reading, consistency of laterality and W.I.S.C. 
subtests. Both groups were very poor at reading, but whereas all the Motor 
group had inconsistent laterality, only a third (i.e., a normal proportion) of the 
Language were inconsistent ; on the other hand, the Motor group had a much 


better knowledge of right and left. The Language group, but not the Motor, 
had poor scores on the Arithmetic and Digit Span subtests. Also the Language 
group was significantly worse than the Motor on Block Designs. 

It'was suggested that these might represent two distinct groups of develop- 
mental language defects, and that recognition of the two might help to explain 
some of the contradictions in the literature. 


J.—CLINICAL BACKGROUND. 


Tus is a preliminary attempt to measure by psychological tests certain 
clinically observed features of children suffering from dyslalia and referred 
to the speech therapist between the ages of 3-6 years. It was possible for the 
children to be sorted into two groups on the basis of speech performance alone 
as already described by Greene (1960) and classified as suffering from a ‘ motor ’ 
or ‘ language ’ disorder. The speech of the youngest children at first exhibited 
characteristics of both motor and language disorder. This was to be expected 
since the normal child has the same difficulties in handling articulation and 
language when learning to speak, though to a lesser degree. By the age of 4: 0- 
4:6, the pattern of a predominantly motor disorder on the one hand and a 
language disorder on the other had emerged. Diagnosis in each case was 
supported by a careful case history of general and speech development and the 
tests for motor and language maturation detailed below. 


Motor Disorder Group. 

The children classified as suffering from motor disorder had great difficulty 
with articulation and the accurate imitation of consonants and vowels, 
especially complex consonant blends such as sp, sk, str, etc. The omission and 
substitution of consonants followed a pattern which reduced the conventional 
phonetic sequences of the language to simpler forms, for example ‘ biscuit ’ 
pronounced ‘bitit,’ ‘swim’ as ‘fim.’ Diphthongs were frequently reduced to 
vowels as when ‘ boy ’ was pronounced * bore.’ These mispronunciations were 
s was invariably substituted by ‘t’ and ‘sw’ by ‘f.’ 


consistent as when ‘sk’ A y 
Once the child’s individual code was known, it was possible to understand what 
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id. Vocabulary and language development were normal for age and the 
ae dee ey eles no difficulty. _These children did not have 
difficulty with auditory discrimination and distinguished readily between right 
and wrong pronunciation when spoken by the therapist. Auditory memory was 
normal for age when tested with nonsense syllables and the standard sentences 
from the Terman-Merrill Intelligence Scale. Drawing was good and they had 
no constructional or spatial difficulty. 


The developmental history and clinical testing of motor maturation 
supported the diagnosis of delayed motor maturation. Walking was late. 
The mean age of walking of the group used in the investigation (see Method 
following) being 16-6 months, range 14-23, whereas the mean of the language 
group was 14-4 months, range 10-18. 


Accurate dating of commencement of speech was not possible in every case, 
since the mother, whose report was the sole source of information, was not able 
to determine when pre-linguistic babbling merged into speech ; some mothers in 
both groups were sure their child had never babbled in the first year. 


Clinical tests of motor maturation included closing fist and thumb, thumb 
and finger opposition, and cutting paper with scissors (Merrill Palmer) ; hoppin, 
on one foot (Gesell) and ability to protrude and lift tongue-tip (Morley 1957), 
besides ability to imitate articulatory movements in consonant and vowel 
syllables. All children exhibited signs of late motor maturation and poor co- 
ordination which appeared to stem from a difficulty in isolating movements, 


with delay in the normal synkinetic evolution as described by de Ajuriaguerra 
and Stambak (1955). 


The Language Disorder Group. 


The language disorder group did not suffer from a similar delay in motor 
maturation, they crawled, sat and walked within normal limits and performed 
well on co-ordination tests. They had little or no difficulty in direct imitation 
of consonant blends and words but had considerable difficulty in recalling the 
correct phonetic sequences and also grammatical constructions in spontaneous 
speech. Their speech was characterised by inconsistent substitutions and 
omissions which rendered speech often incomprehensible even to the mother. 
They made unpredictable mistakes in the order of sounds within words and 
words within sentences. Conjugation of verbs was poor and they peformed 
badly on the Watts language picture tests. They had difficulty in remembering 
names of things and people. They could discriminate by ear perfectly when 
pictures of similar sounding articles were named and when the correct pro- 
nunciation was contrasted with the child’s error. They could not determine 
whether a word was right or wrong, if presented with their own erroneous 


version, as in the case of the child who said ‘ umblella’ or ‘ lumbella’ for 
umbrella. 


Auditory memory as tested by repetition of groups of nonsense syllables 
and the standard sentences from the Terman-Merrill Intelligence Scale was 
deficient. Some of the children had a constructional or spatial difficulty and 
failed the Merrill-Palmer tests for copying a cross, a star and the three and six- 
cube pyramids and assembly of the Manikin. Drawing showed an inability to 
appreciate form and orientation. For these reasons it was felt that these 


children had a deficient sense of order in time and space associated with the 
difficulty in retention and recall of auditory material. 


a 
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II.—METHOD. 


The subjects were children with severe speech defects referred for treatment 
to the speech therapist (M.C.L.G.). None of them had any physical malforma- 
tion; clinical and audiometric examinations by the appropriate authorities 
had revealed no neurological or auditory abnormality. Those who fell into one 
of the two categories described above were referred to the Psychologist (T.G.C.), 
who carried out the tests. The speech therapist made the differential diagnosis 
between motor and language before seeing the test results, but the diagnosis 
was not known to the psychologist until the whole series had been tested 
The children had to be at least five years old when the testing was carried out, 
because this is the minimum age for one of the tests used (the W.LS.C)) ; 
also they were not.sent until their speech was at least intelligible to a person 
unfamiliar with them. All appropriate cases were referred for testing until a 
total of 20 was reached (8 motor (7MIT) and 12 language (8M4F)). This took 
nearly two years. In the testing session, the following investigations were 
carried out : 


_ W.I.S.C. The IQ. was calculated from seven subtests: Vocabulary, 
Similarities, Arithmetic, Digit Span, Picture Completion, Block Designs and 
Coding. Any child with a full I.Q. below 85 was excluded, to avoid cases where 
speech disability was due to inadequate general intelligence. The subtests were 
used for the investigation of immediate memory and ordering ability. The 
advantage of having standardized scores was thought to outweigh the dis- 
advantage of using tests which were not specially devised ‘ pure ’ measures of 
the abilities concerned. From the way in which the scores are standardized, 
a child’s Scaled Score on any subtests can be compared directly with his scores 
on other subtests, and also with the scores of other children of different ages. 
Digit Span was used as a measure of immediate memory, Arithmetic and Block 
Designs as measures of ordering ability, Arithmetic, at this level, being a measure 
of the child’s familiarity with the sequence of numbers and the ability to apply 
it to simple addition, Block Designs measuring the ability to copy groups of 
objects in their correct spatial relationships. They were compared with 


Vocabulary and Similarities, which were considered the most free from ‘ order- 


ing.’ The other two subtests were omitted, as not falling definitely into either 
category. It was predicted that the Language group would be relatively poor 
on the Digit Span, Arithmetic and Block Designs, compared with their Vocabul- 
ary and Similarities. 

Reading, etc. In Hallgren’s (1950) large study of reading disability, the 
only factor of those studied which showed a highly significant relationship to 
dyslexia was the presence of speech defects. The reading of our cases was 
examined to see if it was poor, and to what extent. No hypothesis was made 
about the difference between the two groups. A mechanical reading test was 
used—Schonell’s Graded Word Reading Test ; this was thought preferable 
to a comprehension test, as being less dependent on intelligence. The reading 
age obtained from this was converted to a quotient in the usual way, so that it 
could be compared approximately with the I.Q. As it was not thought appro- 
priate to test a child’s reading before the age of 6, any child who was not yet 6 
on the first testing was seen later, just for the reading, when he was at least 6. 


Each child was asked to write his name and two simple sentences, ‘abig dog’ 
and ‘I like apples,’ chosen to bring out reversals or transpositions of letters 
(what Orton (1937) calls ‘static ’ and ‘kinetic’ reversals) since these are 
important to theories which link speech and reading disabilities with peculiarities 
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ity. is was scored simply for the presence or absence of reversals of 

ee aj ee were dictated to children who could not spell words. 
Laterality. The history of the study of the relationship between speech and 
reading difficulties and various types of laterality (left handedness, mixed or 
crossed laterality, etc.), is a long and contradictory one, well summarized by 
Vernon (1957). No prediction was made as to differences in laterality between 
the two groups. The assessment was made by a simple group of observations 
of the ‘ choice’ type: hand used for writing, hand for throwing (five trials), 
hand for catching, when instructed to use one hand (three trials), eye for 


counting fingers looking through a small hole in a piece of paper held in two 
hands (three trials, in three directions) 


, eye used in naming pictures through a 
‘ V-scope ’ (Miles’ ABC Vision Test) (five trials) 


, and foot for kicking (three 
trials). Clark (1957) found kicking the best and most consistent test of footed- 


ness; a matchbox was used instead of a ball to facilitate central placement. 
There are twent: 


y items in all, and they were scored just as ‘ consistent’ or 
‘inconsistent.’ The criterion for ‘ consistent ’ w. 


as nineteen or twenty items the 
same side, unless the one different was the hand used for writing ; anything less 
than nineteen was‘ inconsistent,’ though as it turned out the highest score in the 
‘inconsistent ’ group was seventeen, and the ‘ consistent ’ all scored twenty. 
The children were also asked to indicate their left foot, right ear, etc. 
(five trials) and to perform crossed movements (e.g., “ touch your right ear with 
your left hand ”) (three trials). These 


D i two sections were simply scored as 
“correct ,’ if all trials were correct, or ‘ incorrect.’ 


T1.—Resurts. 
Intelligence and Analysis of Subiests. 


TABLE 1 


INTELLIGENCE AND SUBTEST SCORES oF THE Two Groups. 


Language 
(N=12) 


Range S.D. 


5:3-11-1 
105-131 


Range 


1:33 |5-1-9-8 
10-50 88-116 


Vocabulary... 2 i 10-14 
Similarities ‘ 


i $ p 8-15 
Digit Span .. ` : : ; 
Arithmetic .. |. 
Block Designs 


significantly better on some an Pela £0. thatthe 
ritical subt 
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i ces, we have an estimate of their relative efficiency on 
e a cee with the neutral ones, I.Q. being held constant. 
Also, from the way in which the scaled scores are derived, we know that random 
groups of the population would be expected to have a mean difference of nought 
when any two subtests are compared ; so by examining the approximation of 
our group mean differences to nought, we can estimate whether th 2 
the general population in their relative skill on these tests, LQ. ag 
constant. Table 2 shows the result of t} 
It will be seen from Table 2 that the m 
deviate from nought in the predicted direction, but onl 


es are in the expected 

If a one-tail test is 
used (as is reasonable since the results were predicted) all three are significant 
compared with the general population 


8 quotients of the two groups 
One child was lost from each group ; these both did 
the original testing at the age of 5, and could not be 


obtained for a second 
assessment after their sixth birthday. Neither of these 


could read any words, 
and they only named about half of the letters in the first line of the test correctly 


À tor group and five out of 
eleven of the Language had readin; i i 

Comparison of the (1.Q.—R.O 
6. 


TABLE 3 


MECHANICAL RE 


ADING QUOTIENTS. 


Motor Group 


Language Grou 
(N=7) (N=11 j 
GPE ies essmematit.. conan: | M=7-44 | M=7-64 
| Range=6-3—1]-] | Range =6-1—9.g 
Ee eee ae M=115-00 | M=99-82 
Range=105—131 | Range=88—115 
| 
ROE ee | M=86-14 | M=73-64 
ange =72—107 | Range=61—87 
o M= + 28-86 | M= +26-18 
Range= + 13—+ 38 | Range= +8— +45 


T erality tests, 
5 1 - the probability of getting 
s this, by direct calculation (two-tailed) is +008. There 

own by the Motor Group. 
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Putting the predominant choice for each member i 

j T in the order 

ae = three RLR, two LLR, one RRL, one RLL and 1 TRL. Bone at thon 

u aa inconsistencies within one member. In the Language Group there 

lah RRR, one RRL, two RLR and one LRR. From the data given by 

mn r ( 949) the proportion of inconsistent in the Language Group is about 
at would be expected in an unselected sample of children of this age group. 


. From Table 1, it can be seen that the Motor Group are older an 
intelligent than the Language Group, so this lack of establishment of a 
cannot be ascribed to lesser general maturity. 
_ The ability to name one’s sides is probably an all-or-none ability. Accord- 
ing to Launay (1952a), nearly all dyslexic children can do it by the age of 8 
and nearly all normal children by the age of 7. Thus, it is unlikely to be dis- 
pomnieieag after the age of 8, and we restricted this comparison to children 
ae the ages of 5 years 0 months and 7 years 11 months. This left seven of 
ae Motor and nine of the Language. Five Motor and only two Language were 
ferent in simple indication of sides, and all seven were of inconsistent laterality. 
he nine who made mistakes were three inconsistent and six consistent. The 
Probability of drawing all seven inconsistent from a group of ten inconsistent 
and six consistent (two-tailed) is -021. Unfortunately, knowledge of sides in 
this age-group is clearly related to intelligence. The Mean I.Q. of the ‘ correct á 
group is 112-3, of the ‘ incorrect ’ group 98:8. However, there is certainly no 
evidence of a relationship between inconsistency of laterality and lack of 
knowledge of sides, and in any case the ‘ incorrect * group is mostly drawn from 
the Language group whose IQ. is thought to be lowered by their special dis- 
abilities. There were four 7-year-olds in the Language group, and they all 
made mistakes ; so they are probably worse at this than normal; the three 
7-year-olds in the motor group were all correct. Only three of the whole series 
were able to carry out crossed movements, two motor and one language, all 


three of iaconsistent laterality. 

_ In considering the question of reversals of writing, the second test of the 
original 5-year-olds was used. As two were missed, this gives six Motor and 
eight Language between the ages of 6 years 0 months and 7 years 11 months. 
Four of the Motor group and six of the Language showed reversals, five out of 
six with consistent laterality and five out of eight inconsistent. There is no 
relationship here between reversals and type of speech disorder or type of 
laterality. The degree of reversal varied greatly from just one error (‘b’ for 

d’), to a complete confusion of all letters. This clearly needs a more syste- 
matic method of study. 


IV.—DIscussIon. 


The prediction of the relatively poor performance of the Language group 
on Digit Span, Arithmetic and Block Designs was borne out in the case of 
Digit Span and Arithmetic. The Motor group were significantly better than 
the Language group on the Block Designs, but this was due to the fact that the 
Motor group were better on these than on other tests, not that the Language 
group were worse. Digit Span, the most discriminating test, is a test of 

immediate auditory memory,’ but in memory tests it is always advisable to 
ask ‘memory for what ?’, and in this case it is memory for numbers 1% order A 
mistakes are often mistakes of order, the correct digits being given but in the 
wrong sequence. So this can, perhaps, also be taken as evidence for the poor 
Ordering ability of the Language group. 
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i i ing is the difference between the two 

ies ae Reni ne meee group being all inconsistent, the Language 

Sre A a ‘normal’ number of children with Inconsistent laterality. 

his seems to provide a further definite basis for distinguishing the two types, 

in addition to the speech differences. It may offer a lead to the solution of the 
problem of the contradictory findings about laterality in reading disabilities. 


groups in 


the following 
type, poor reading, in- 
consistent laterality, no deficiency of immediate audit 


sy. In absolute terms, Group 2 
are considerably worse readers than Group 1. Not one of them reaches a 
and I.Q., but it 


-Q. is partly composed of tests 
in which Group 2 are thought to do badly becau: 


ieir deficiency 
ated in a completely satisfactory 
y the clinically observed spatial 
guage Group, although t, significantly 
fact that it is a 


more discriminating ; de Hirsch (1957) 
an test to be u v KItis hoped to 
T with more refined procedures. 

Much of the stud guage defects in chi 
concerned with 5S0 three things : (a) the actual disability in 
writing ; (b) peculiarities of later: > (c) defects in spatial 

e a priori reasons for linking (b) and (c) but 
empt to do so which acco 


Idren has been 


confusion ; 
and (c), with no co, 


The linkage of (a), (b) and c) is 
Ge Gea Senta the notion of a ‘ dominant hemisphere l np mee of 
and also in (a) ingen eme completely domina: z 


> 2 ; ld be though 
lack of dominance, in the (ercon 8: 
consequence of (b), which N by Orton 
a consistent sense of directi ed ae 
umphrey and Zangwill (1952), Penfield and Roberts (1959) Zangwill (1960) 
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Another source of confusion in this subject i: 
reading and writing. 
such reversals are re. 
going to make a mistake in writing, it is 
that it can be interpreted as a reversal o 
is drawn to this kind of error to the exclusion of other kinds. This point is 
clearly brought out by Miles (1961), in his discussion of the mistakes made by 
two children with ‘ developmental aphasia.’ It is clear from his analysis that 
the disorientation which he considers to b 

is i i that 


highly complex thing, involving space and time. 

his two cases of severe disability were both of consistent laterality. It is well 

known that nearly all children show reversals at some stage of learning to read 
It might be that some reversals are just mistakes, 


and write (c.f., Launay, 1952). 
while others are symptomatic of a genuine disorder of orientation. 

It is recognised that much of this discussion is speculative and goes beyond 
the facts. However, we feel that we have distinguished two types of develop- 
mental language disorder and that investigation with the two types in mind 
may help to clarify some of the issues. 
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AN ANALYSIS OF C. W. VALENTINE’S REASONING TEST 
FOR HIGHER LEVELS OF INTELLIGENCE 


By H. J. HALLWORTH 
(Department of Education, University of Birmingham) 


SUMMARY. Answers to C. W. Valentine’s reasoning test for higher levels of 
intelligence made by 286 men and 262 women were analysed and compared. 
Analysis of the item inter-correlations by the method of principal components 
gave a factor of general reasoning ability for both men and women. Upon 
rotation to the Varimax and Quartimax criteria three group factors were 
obtained which were comparable for the two groups of subjects. These factors 
are described and tentatively named. The results indicate the potential value 
of a detailed examination of the thinking processes involved in answering the 
items of this test. 


I.—INTRODUCTION. 


Upon. the publication of his article on “ The use of a new reasoning test for 
selection of university and training college students ” (Valentine, 1961), Professor 
Valentine kindly made available for further analysis the test scripts of 548 
university and training college students and sixth form pupils. This article 


reports the results of the analysis. 


II.—METHOD OF ANALYSIS. 


d into two groups, 286 completed by men, 262 by 


The scripts were divide 
o similar analyses were carried out, 


women, Using an electronic computer, tw 


one for each group. 

The test gives a score for each of sixteen different items. Including total 
test score and age, there were, therefore, scores on eighteen variables for each 
subject. Means, standard deviations and product-moment inter-correlations 
were obtained for all variables. The correlation matrix for the sixteen test 
items was analysed by the method of principal components, and the factor 
loadings were rotated to the Varimax (Kaiser, 1958) and Quartimax (Neuhaus 


and Wrigley, 1954) criteria.* 


IIJ.—MEANS AND STANDARD DEVIATIONS. 

The mean age of the men was 22 years 3 months, and of the women 19 years 

4 months. This difference was due to the fact that, at the time of testing, 

national service delayed the college entry of most men. The mean test score for 

men was 28-9, with a standard deviation of 14:5; the mean score for women 

was 24:1, with a standard deviation of 12-7. These figures are consistent with 
those given in the original article. 7 

On fourteen of the test items men had a higher m S 

For two items women had a higher mean score, but in neither case was the 

difference between men’s and women’s means significant. It is, however, of 

interest that these two items are different from all others in that they each 


combine four characteristics : 
(1) Each specifies one attribute of a logica 


Dite. 
a Ferranti Mercury computer and were written by the 


* All programmes used are for 
author in Pis ‘Autocode and machine code. (Hallworth, 1962.) 


mean score than women. 


1 class, using the words “ None 
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(2) Each specifies another attrib 
wording “All...” 


(3) Neither of the two attributes is exclusive to the class. 


(4) The answer demands an appreciation of 
defined by each of the two attributes are 
Precise information regarding 

is not often available. This fin 

information appear from another 

it would surely demand further i 


As would be expected, the 
women for most items. 


ute of the same logical class, using in effect the 


the fact that the logical classes 
not necessarily co-incident. 

adult sex differences in patt: 
ding is recorded because, sh 
sample using this test, or fro 
nvestigation of the thinking 
men had a higher st 


erns of reasoning 
ould comparable 
m another source, 
processes involved. 


andard deviation than the 


IV. 


—CORRELATIONS WITH TOTAL SCORE. 
Correlations of to 


tal score with age 


a were insignificant for both men and 
women, being —-05 and + 09, respectivel 

was probably due to 
had recently gained 


| 9, re y. The negative correlation for men 
the inclusion in the sample of a numb: 


inclu 0 er of sixth formers who 
university open scholarships. 


TABLE 1 


Item 1 


z{s[alslels 


(a) A General Factor, 


From the Correlation i 
e mat: ix i 
gróups of subjects, eight Principal eon teen n ae ae 


were extracted. These accounted. 


8 | ofi 11 12 | 13 14 | 

| r L i 

CET 47 | -47 | -40 |50 | -35 | -43 | 41 ‘39 | 52 54 | -53 | -58 | ‘53 | -41 | -32 | -31 
: [| [2) a9] art a0 “46 | 43) 43 48) 53 50 35) 30] 13 


a 
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a a total of 69 per cent. of the variance for the men’s group and 67 per cent. for 
he women’s group. In each case the first component could be regarded as a 
general factor, accounting for 22 per cent. of the variance for the men’s group 
and 18 per cent. for the women’s group. Item loadings on this component are 


given in Table 2. 


TABLE 2 


ITEM LOADINGS ON First PRINCIPAL COMPONENT. 


3|4 5 | 6 7[8 o fio 11 | 12 13 | 14 15 | 16 


Men eass cu -49 | -42 -39 | -50 | -28 | -51 | -31 | -30 | -69 | -58 | -49 | -70 -69 | 31 | -22 | 20 
Women .... JRE -29 | -15 ogg -40 | -69 | -67 | -27 -18-06 


his kind is, of course, to be expected from this method 


A first component of t 
ysis be accepted as it stands, or 


of analysis. The question is, should the anal 
should the axes be rotated ? Since there are always many possible sets of factors 


the decision to accept one set rather than any other must be made on psycho- 
logical grounds. In the present case, acceptance of these components without 
rotation would give the general factor which has been postulated, and in this 
sense, the prediction is verified. In popular terms, this could perhaps be regarded 
as a typical British solution to the problem, extracting as large a common 


factor as possible. 


However, the remaining components 
groups, nor did they make very good psychological sense in either case. It was, 
therefore, decided to rotate the axes in an attempt to discover another set of 


factors, more of which would have good psychological meaning. 


were not comparable across the two 


(b) Group Factors. 

For both groups, all eight components 
criterion. This is claimed by Kaiser (1958) to give result 
Thurstone’s ‘simple structure,’ therefore, we may expec 
American solution. The results are given in Table 3. 
Since the method of rotation tends to spread loadings more evenly between a. 
number of group factors, the general factor has disappeared, and there are now 
three factors which are fairly comparable for both groups of subjects. 


Varimax Factor 1 has its heaviest loadings on items 9, 12 and 13. In order 
te that in section two 


i is fa rchologically, we must first no 0 
to interpret this factor psy chologically t fist note 1061), two kinds of 


of t there are, as explained b Professor 
ee 4 D perceive a correct argument ; secondly, 


item: first, items requirin; ability dly 
items Re abby to P ave an incorrect argument and know why it is 
incorrect. Items 6, 9, 12 and 13 are the only items of the first type. It would 
probably be agreed that they can be answered more intuitively ’ than can the 
other times. The relatively low loading of item 6 may be accounted for by the 
greater complexity of argument in this item. Similarly, the relatively high 
loadings of items 10 and 11 may be accounted for by the brief wording of the 
argument and relative ease of answenng. If so, the best general description 
of this factor is ‘ intuitive reasoning - $ 


were rotated to the Varimax 
s approximating to 
t a more typically 
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TABLE 3 


VaRIMAX Factors FoR SIXTEEN TEST ITEMS. 


-4, inclusive, 
asoning.’ This factor was predicted. For 
e most comparable factor is i 


? however, a low lo 


8S on items 15 and 16 for both 
ems, the last of the test, are of a different order 
ets and yield a Telatively low mean score for both 
men and women, Both Mvolve relationships between five categories, and the 
answers involye double i 


egation, Although Factor III y 
may be Interpreted as Telating to ability to d 
problems, 3 
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(c) Further Group Factors. 

The Quartimax method of rotation has commonly been used for cognitive 
data on the assumption that it will bring out any common factor (Harman, 
1960, p. 301). It was, therefore, used in the present instance in an attempt to 
obtain a general factor accompanied by other factors with psychological 
meaning. In Table 4 are listed the principal Quartimax factors for both 
groups of subjects. No general factor was obtained. Comparison with the 
Varimax factors shows a remarkable similarity. For both men and women, 
the sets of Varimax and Quartimax factors are almost precisely the same, 
differing by little more than the limits of precision written into the computer 


programmes. 
TABLE 4 


QUARTIMAX Factors FOR SIXTEEN TEST ITEMs. 


WoMEN 
1 
68 
65 
56 
+35 


Factor 


Item 


14 


14 


15 


16 


Square Sum.... 


VI.—CoNncCLUSION. 
i i than one set of factors 
i ith factorial analyses, there is more thar 
Hide oe ee meaning. Accepting the principal a 3 
they stand thee is a general factor of reasoning ability accounting for app: 
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-i of all the variance. Rotating to either the Varimax or Quarti- 
A Gees are three factors which are related to features of the test 
items and to the reasoning abilities which they apparently demand. These 
factors, and certain differences between Scores for men and for women, suggest 
that it would be valuable to have a detailed examination, possibly along the 
lines used by Duncker (1945) and Peel (1962), of the thinking processes involved 
in answering the items of this test. 
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THE DEVELOPMENT OF CHILDREN’S IDEAS ABOUT 
COUNTRY AND NATIONALITY 


Part I: THE CONCEPTUAL FRAMEWORK. 


By GUSTAV JAHODA 
(University of Glasgow) 


Summary. A random sample of children aged 6-11 in four Glasgow schools 
was interviewed and tested, two situated in middle-class and two in working- 
class districts. The formation of concepts relating the series of units Glasgow- 
Scotland—Britain is traced in terms of both verbal formulation and spatial 
representation. The difficulty for younger children of such concepts as ‘ town’ 
and ‘ country ’ is demonstrated. The emergence of national identification is 
described, indicating the major sources of initial confusion. 


I.— INTRODUCTION. 


Tuis study was originally inspired by Piaget (1928, 1951), who attempted to 
establish developmental stages for children’s concepts of country and nationality. 
Unlike most other replications concerned with more clear-cut spheres of thought, 
such as the work of Lovell (1961) on mathematical and scientific concepts, the 
findings were in several fundamental aspects at variance with those of Piaget. 
The theoretical problems raised thereby will be dealt with separately. Here the 
aim is a two-fold one: to analyse the gradual emergence of concepts and 
relationships, and to offer tentative developmental norms for children of 


contrasting socio-economic background. 


II.—METHODS. 

(1) The Sample. The first stage of sampling was a ‘ purposive ’ one: 
two primary schools were selected from well-to-do middle class areas, where 
parental occupations are mainly clerical, professional and administrative ; 
in contrast, two other schools were located in a high-density and now much 
dilapidated working class district, where semi- and unskilled occupations 
predominate. 

Within the schools, a sampling fr 
pupils being divided according to age an V m. 
minimum age was fixed at six, because it had been obvious from preliminary 
trials that it would have been 
procedure to the youngest children ; 3 y 
essential if valid aaran were to be made. After excluding any children 
not of Scottish parentage, selection was done at each agı y i 
table of random numbers. three boys and three girls always forming the unit. 
This meant that the sample was widely scattered, only a small number being 

to minimize the possibility of 


taken from any one form. The object o 0 ze the p 
Pert among the participants, and there is every indication that this 
was satisfactorily achieved. The general plan is, perhaps, easier to follow when 


set out schematically : Ti w 
i District (M, orking Class District 
Middle Class Dis (M) Peoria Hake ch 


School A School B ( ) 
B irls Boys Girls Boys Girls Boys Girls 
os (ces > 6-7- 8- 9- 10- 11-) 
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There were thus : eee 
2 (social classes) x 2 (schools) x 2 (sexes) x6 (age levels) x3 (individuals) = 144. 


Unless otherwise specified, all subsequent tables will be based on this total 
of 144 children. Although sex had to be one of the controlled variables, 
significant sex differences failed to emerge. Hence results will not be shown 
separately for boys and girls. This does not necessarily mean that there are no 
contrasts between the sexes in this field ; in fact, some of the differences were at 
least suggestive. However, in view of the relatively small numbers, dictated by 
the time-consuming nature of the interviews, it was felt that one would not be 
justified in putting much stresss on this aspect. 


(2) Interviewing and Testing. The children were seen individually. Each 
session lasted between 35 and 70 minutes, the mean being about 45 minutes. 
In order to arouse and maintain 


the interest even of the younger children, the 
procedure was devised so as to present considerable variety. 


In an investigation of this kind, the handling of the situation is of para- 
mount importance. The procedure was not started until the children had been 
put at ease by some preliminary chatting, and, in fact, an atmosphere of 


The schedule itself was indended as a 
guide, not a straight-jacket. i 


suggestions. 
and the range 


, III.—DEVELOPMENTAL FRAMEWORK. 
(i) Geographical Stages. 


ally extends outwards. 
» With the child at th 


may be called the ha to be introduced here is what 


with some diffidence 
-stage ’ for short. This is 


G. JAHODA 49 


The measure was arrived at after attempting, without success 7 
Piaget's scheme. The one adopted here Eene the use of what Rue 
anea natural ’ units for the purpose of categorization. W hilst not eliminating 
the element of arbitrariness entailed in the slicing up and pigeon-holing of what 
is essentially a continuum, it does at least tend to reduce it. There were, of 
course, a number of marginal cases, but the large majority could be easily and 
unambiguously classified. The categories, or G-stages, were as follows : 


| 
| 
' G.I. : No conception of Glasgow as a unitary whole. 
ý G.II : Conception of Glasgow as unitary whole, but no conception of it as 
_ __ part of Scotland. 
` G.IIL: Conception of Glasgow as part of Scotland, but no conception of 
: ___ Scotland as part of Britain. 

G.IV: Glasgow-Scotland-Britain relationship correctly expressed. 
t 


The ideas of children at these four stages will now be illustrated with 
examples. The questions by the interviewer are shown in brackets ; the 
symbols after each quotation stand for type of school and age. 


G.I. 
ristic of children at that stage that they seemed to think of 


It was characte: 
though not usually including their 


Glasgow as some kind of vague entity near-by, 
own immediate geographical location. 


(Where is Glasgow ?) 
Beside the playground, in X street. (W6) 
Its up by the park there—you go round the corner. (W6) 
Down in Y avenue. (M7) 
‘if I don’t know the way to Glasgow. (W7) 
were familiar with the word ‘ Scotland,’ their notions 
of this concept were apt to be nebulous in the extreme. Several main tendencies 
are distinguishable : (e) the fusion of Scotland with Glasgow ; (b) the belief 
that Scotland is somewhere outside Glasgow ; (c) random associations called up 
by the word ; and also combinations of these.. 

(What is Scotland ? Where is Scotland ?) i . 
I get muddled up—is that not Glasgow? Yes, it’s just Glasgow. (W6) 
My granny stays in Scotland. (Do you go and see her?) Yes, I go up at 


Although nearly all 


i 3 o'clock every day. (W7) 
I was born in it—it’s the name of the town. 
It’s just outside Glasgow. (ws) a 
Scotland is the capital of Edinburgh. It’s in Glasgow. (M7) 
{ Scotland is a street. (Yes, Scotland Street, but have you heard of a place 
called Scotland?) Yes, Scotland the Brave, it’s up 1m the Highlands. 
WT , : 
M The EM ences the telly—I saw it last night. (Where is Scotland?) 
i Away far. (W6 


The majority could say nothing about Britain at all, beyond perhaps the 
fact that they thought they had heard of it. If it was conceived as a place at 
all, it was clearly a very remote one. 

i t heard of Britain ?) 


'i (What is Britain? Have yor ee 
I've heard the word, but I don’t know what it’s like. (W 6) 3 
» Yes. (What do you think it is?) Our next-door neighbour went to it and 
T (M7) 


' brought some dolls home. 


a 
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It was there that the men fought the war. (W8) 


In general, it may be said that at this first stage the organization of 


geographical concepts is minimal, responses being mainly in the nature of 
diffuse and undirected associations. 


GII. 


The chief difference between this stage and the preceding one is that the 
children’s answers showed an awareness of being within the unit called Glasgow, 
which was no longer thought to be located somewhere else. Responses to the 
question “ Where is Glasgow? ” were fairly stereotyped, usually taking the 
form “ It’s here, where we are.” There were, moreover, a few replies which may 
be regarded as transitional to the subsequent stage, though they undoubtedly 
belong to G.II because of their inherent contradictions. 

Glasgow is in Scotland. (Where is Scotland ?) 

Scotland is a country outside Glasgow. (W7 

Glasgow is in the middle of Scotland. (What is Scotland ?) 

where the people play the bagpipes. 
lands. (M7) 

The remainders’ ideas about Scotland were largely similar to those of G.I, 
including the notions that Scotland is ‘in England’ or ‘ part of America’ 
(both M6). 


A higher proportion ventured some statement about Britain, but 
the level of accuracy was usually not much higher. 


(What is Britain?) 

It’s a city in England. (W7) 

It’s a big town: mountains are there, and they climb on it. (W7) 

I don’t think I’ve been there. (M6) 

It’s a city in Scotland. (M9) 

The major advance, therefore, is confined to the consolidation of the con 
cept of Glasgow as an entity in which one lives and which constitutes one’s 
surroundings, 


It’s a place 
(Where?) Away in the High- 


G.I, 


Glasgow is in Scotland.” or words to that 

effect. Scrutiny of the total response patterns suggests, however, that one 

v ) h into this, It is true that re lies did not 

go astray until after the question “ Where is Scotland ? ”, but mer shortly 
probably tends to be somewhat shaky. 


‘sup north. Britain 
land surrounded by 
: “continent ma It’s in the world. Britain is really the whole 
CO is : , i 

cane Gare It's beside Wales and Ireland. Britain is mostly 
2 MS mtan; it’s soit of Scotland—another name for it. 
Scotland is a countr . Britain i i 
x Long pause) Of [oan $ ne United, State. (What do vou mean ? 

nd is a country, (Where ?) Can't s iaia 
a rez say. Britain is 4 f different 
countries, (What Countries ?) lagna onion Ieee wi) 


m 
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È When one considers these and similar answers the phrases “ Glasgow is in 
Scotland ” and “ Scotland is a country ” begin to have a suspicious air of an 
empty formula. Additional evidence shows that this is not always justified, yet 
in the main one seems to be faced with the parrot recital of terms learnt without 
adequate understanding. In other words, grasp of relationships has failed to 
keep pace with the acquisition of a new vocabulary. It is thus, perhaps, 
doubtful whether G.III should really always be accepted as a genuine stage 
regarding concept formation ; it might be safer to look on it as a temporary 
transition between G.I] and G.IV. 


GIV: 

This can be dealt with very briefly. In order to qualify, a child had to 
convey some understanding of Britain as well-defined unit. The first of the 
quotations below represents roughly the acceptable minimum. 

Glasgow is in Scotland. Scotland is a country in Great Britain. Britain 

-İs some countries joined together. (M6) 

Glasgow is in the Central Lowlands of Scotland. Scotland is a country. 

It’s just north of England. Britain is Scotland, England, Wales and 
Northern Ireland. (W11) 

With G.IV children have achieved the norm: 

The levels reached by various sub-groups are summarized in Table 1. In 
this and most subsequent tables results are presented separately for three age- 
ranges, comprising twenty-four children each ; within age-ranges there is a 
further sub-division into Mf and W schools. This produces a general picture of 
developmental trends as influenced by age and socio-economic background. 


al adult standard in Britain. 


TABLE 1 


GEOGRAPHICAL STAGES REACHED By TYPE OF SCHOOL AND AGE. 


W Schools 


M Schools 


Geographical 


Stages 8-9 


G 

G. 
G. 
G. 


There is a highly significant difference (P less than -001) between the 
distribution of stages in M and W schools. Jt may also be noted that G.I 
contains a smaller total than the stages preceding and following it, thus affording 
some support to the view that it might well be a relatively brief transition stage. 


(ii) Spatial Representation of Geographical Relationships. ip: 

The procedure used in determining G-stages is open to the objection that it is 
conducted entirely at the verbal level, and in fact, some doubts have been 
expressed on this score as regards cM. A special test dealing with the appro- 
Priate spatial concepts was, therefore, devised. It was administered in two parts 
the first being a preliminary training session designed to familiarize the children 
ìn a simple manner with the symbolic relations involved. 
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FIG. 1 
MATERIALS USED IN THE SPACE TEST. 


BLACK (choo! /Bri tain) 


WHITE (Not used/Scotland) 


BRASS @ 6 ric<:/ctos9 


WHITE (Room/England) 
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The material employed, seen in Fig. 1, consisted of one black and tw i 
plastic boards, and a brass disc. For the try-out the smaller of the aes ieee 
van ett out, and the child asked to reproduce the immediate situation, i.e. 
= f-in-room-in-School i this, is of course, analogous to Glasgow-in-Scotland-in- 
aie Any child failing in this preliminary task was given a detailed explana- 
ee and demonstration with the aid of the materials. This was necessary with 
‘5 per cent. of all M children and 37-5 per cent. of W children (P less than 


001). 


_ The decisive test came later, immediately following a musical test, so as to 
minimize carry-over from the questioning. This time the child was given all 
four pieces, with the information that the largest one was supposed to be 
Britain. The task was (a) to decide which of the others represented Glasgow, 
Scotland, and England, respectively, and (b) to assemble them correctly. The 
criterion for failure under (a) was simply not choosing the small brass disc for 
Glasgow ; only a child making this mistake, whatever the subsequent perform- 


ance, was classed as S.I. 


_ In order to qualify for success in positioning, three conditions had to be 
satisfied : brass disc on top of one of the white pieces ; white pieces side by 
thin the area of the black one. 


side, not on top of one another ; white pieces wi 
vas again classed as failure, S.I. The most 


Anything not conforming to this v 

common failures are illustrated in Fig. 2. Among the youngest children, side- 
by-side placement of all the pieces often happened (S.I.—A) ; overlapping 
sometimes occurred ($.I.—B), or simple building up (S.I.—C), which was an 
attempt to reproduce the earlier practice trial whilst coping with the additional 


piece. The most frequent failure of all was something like S.I.—D, namely, the 
brass disc flanked by the two white pieces ; from the conceptual point of view, 
this suggests that such children thought of Glasgow and Scotland as two separate 


units, without as yet any notion of spatial inclusion. 


All children fulfilling the above conditions were scored as successes (S.I), 
ignoring both identification of white pieces and compass orientation. However, 
a further minor distinction was made between children who had at least one of 
these wrong (S.II,), and those who were right in both (S.IJ,). In the example of a 


successful solution shown in Fig. 2, the location of the Glasgow disc within 
but the exact positioning was not 


Scotland was obviously carefully chosen ; t 
considered in the scoring, as it would have been difficult to know where to draw 
the line. 

Results are set out in Table 2, and it appears that the contrast between M 
and W is as sharp as it was in the case of G-stages, the overall difference being 
highly significant (P less than -001). Closer scrutiny reveals a distinct age 
pattern: at 6-7 there is little to choose between M and W; but whilst M 
children progressed steadily from then onwards, W ones exhibited no appreci- 
able advance until after the age of 9, and then they were still far from having 
caught up. 

An i ay be made concerning the relative frequencies of 
S., and Sil, Da etal the former were less stable. In other words, 
Once the basic nature of the Glasgow-Scotland relationship is grasped, compass 
orientation and size soon come to be sorted out as well. 
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FIG. 2 
TYPES OF FaILtuRE AND SUCCESS IN THE SPACE Tesr. 


Sp. I-A Sp.I-B 


Sp. I-C 
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TABLE 2 


SPATI 
AL STAGE REACHED BY TYPE OF SCHOOL AND AGE. 


M Schools W Schools 


Spatial Sta 
si 8-9 
= 10-11 8-9 


2 


(iii) ve of the Two Approaches. 
The bs: pe tl Sa in Table 3, which shows a regular and orderly progression. 
ie percents jip “| most clearly appaient from the bottom row, which gives 
Arse aE > 2 complete failure on the space test for each G.-stage. The 

a reduction in the failure rate occurs at G.III, and there is a 


Spectacular drop between G.III and G.IV. 


TABLE 3 


RELATIONSHIP BETWEEN GEOGRAPHICAL AND SPATIAL STAGES. 


S-Stages ae 
G.I G.II G.I G.IV 
Le E 
25 35 21 6 87 
1 1 5 11 18 
= 1 3 35 39 
ee eS E (aes 
26 37 29 52 = 


Per cent, S.I. for each 
G-stage oersette 


n of the interpretation 
e G.III is clearly one of transition, in the course of 
for « Glasgow is in Scotland ” remains largely an empty 
ie devoid of an adequate conceptual appreciation of the part-whole 
giv ionship involved. On the other hand, a large majority of those able to 
en expression to the broader Glasgow-Scotland-Britain pattern have, 1n fact, 


achieved such a grasp. 


wid Before leaving this important si 
iron, variations in ages at which a 
of 10 Table 1 and 2. It is noteworthy, more í ! r 
and 11 have reached an adequate concept of spatial relations, a fact which 


t 
ends to be concealed by the effects of rote learning. 


alr The comparison offers a satisfactory confirmatio; 
cady put forward. Stag 


which the verbal statement * 


ection, attention should also be drawn to the 
particular stage js attained, which emerges 
over, that only half the W children 


connexion with some of the quotations that 
tries such as Scotland and England as 


(iv) The Concept of ‘ Large Town.’ 


child may have been observed in 
ren were apt to describe coun 
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‘towns.’ The converse error was also made occasionally, though there were 
fewer opportunities for this; a confusion of the kind is certainly implicit in 
such statements as “ Scotland is the capital of Edinburgh. 


Immediately following the topic of Glasgow, children were asked to name 
some other ‘ really big towns.’ This was an attempt to discover the range of 
notions conveyed by the phrase at different ages. In evaluating the findings, 
summarized in Table 4, one limitation of this approach must be kept in mind : 
the total number of responses given by eacl#child was not controlled, and this is 
a possible source of variation. However, as the general pattern seems to make 
good sense, it cannot have been of great importance. 


Among the youngest children, nearly one-third were unable to given any 
answer, or mentioned street names ; the remainder of this group produced a 
large and ill-assorted variety of responses, only two of them entirely correct. 
In the intermediate age-group, most of the crudest errors had dropped out, 
leaving mainly a confusion between small towns, large towns and countries. 
Finally, at the ages 10-11, the chief remaining mix-up was between small and 


large towns. As before, M children were generally greatly in advance over W 
ones (P less than -001). 


TABLE 4 


Types oF RESPONSES FOLLOWING REQUEST TO NAME ‘ REALLY Bic Towns.’ 


W Schools 


8-9 | 10-11 


Large towns only* 
Large and small towns 


Countries, large towns, and 
small towns 

Countries, small towns and 
streets 

Streets only 

Unable to name anything .... 


On the basis of these findings, one may hazard some inferences about elem- 
ents of the thought processes involved. With the least mature group, the 
question evokes just any kind of geographical names that happen to be 
familiar to the child ; in other words, the concept of large towns is so diffuse 
that it is taken to cover a variety of units, from streets to countries. If these 
concepts have not yet become differentiated, one can hardly expect the child to 
establish any kind of relationship between them, as was the task with regard 
to Glasgow, Scotland and Britain. 

Gradually some boundaries begin to take shape, and the child moves into 
a zone of transition. Whilst the difference between towns and countries is not 


fully understood, some children are able to d t least riate 
verbal formula to designate the procu ab Peast An approp 


he relationship (e.g., “ Glasgow is in Scotland ”’) 
others, though fewer, are beginning to apprehend the spatial relations. 
* A ‘large’ town was defined as having a minimum population of 150,000. 


N A 
a 
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The final stage, in terms of tl i i i 

se, in term he classification adopted here, is the achieve- 
pee eee diverts of units. This permits both an insightful verbal 

ula , as opposed to parrot recital a i 

spatial A p ecital, and correct abstract representation of 
am such an interpretation is valid, it follows that the notion of what con- 
Sa csa really big town’ ought to be closely associated with both G and S 
stages. Table 5 indicates that this is, in fact, the case (P less than -001). 


TABLE 5 
TION TO G AND S STAGES. 


Types or ‘ REALLY BiG Towns ' RESPONSES IN RELA 


Response Categories 


Large towns Small towns 


only + small and/or large All other 
and large towns and responses 
towns countries 
Numbers in each category - 57 46 41 
Per cent. in G.I and G.IV 81% 54% 24% 
67% 37% 5% 


Per cent. in S.IT ..eeeeetettee 


IV.—COUNTRY AND NATIONALITY. 


(i) Spontaneous National Identification. 
interview proper started was “ What 
i in which children 


The first question asked when the 
ld introduce a bias; m 


are you? ” The object w 
tended to view themselves, before anything else cou 
particular to find out to what extent, if any, nationality entere 


commonly called children’s ‘ self-image.’ 

Results can be put very briefly : some nine-tenths confined themselves to 
their sex-role, i.e., they said “ I am a boy/girl.” Further probing brought out 
mainly qualifications of the statement already made (e.g-, “Tama schoolboy,” 
‘T am a good girl”), Or with older and/or brighter children, some super- 
ordinate categorization like ‘ person.’ or‘ human being.’ Out of the total sample 
only just over 3 per cent. described themselves as ‘ Scottish, all being from M 
schools. It may also be mentioned that the same proportion referred to their 


religion. . 
i This may be compared with the findings of sie ces ea 
der, i the same time, an using : 
g (1999) carried nit abon n a nationalities) aged 6, 10 and 14, 


Three sets of English children (among other c | 
respectively, were i Juded in the survey- Omitting the oldest, the combined 
pectively, were me. Ives English came to about 3 per 


Proportion of the others who called themse al 
t ah Ree This is, perhaps, somewhat surprising, as one 
ly greater 


ml almost exactly the same. nis rea would sh eae 
ight ha icipated that Sco tish children uld show a 
jes aca ible that such a difference would appear 


awareness of their nationality. It is poss! ; 
ee He i h children showed a great upsurge 1m the frequency 


Aes, as the 14-year-old Englis 
of giving their nationality. 
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Gi) General Inquiry about Country and Nationality. 


adults the phrase ‘ my country ’ has powerful emotional connota- 
eect the ones of ths a sock sentiments is merely touched upon 
in this investigation, which focusses primarily on cognitive development. It was 
concerned, however, with the emergence of understanding of the object to which 

iments refer. . ; 

sei Scotland or Britain had been mentioned by the investigator, 
children were asked “ What is your country ? ” It is apparent from Table 6 
that the youngest children often had no idea, some naming either their street 


TABLE 6 


DISTRIBUTION OF ANSWERS TO THE QUESTION ‘ WHAT Is YouR COUNTRY ?’ BY TYPE OF 
SCHOOL AND AGE. 


M Schools W Schools 


8-9 | 10-11] 6-7 8-9 | 10-11 


Don’t know or miscell. ” 2 12 
Glasgow 

Scotland 

Britain 


or local district. A frequent answer was ‘ Glasgow,’ and this may have been 
influenced by the preceding talk about the home town. With rising age, 
“Scotland ’ gained increasing prominence, whilst the small proportion who 
named Britain remained fairly constant. Differences between M and W schools 
were significant (P less than -01). The general pattern fits in well with the view 
that children begin with an awareness of their immediate locality, which 
becomes progressively extended outwards, pari passu with a sharperned 
discrimination concerning such geographical terms as ‘ town ’ and ‘ country ’ 
The question “ What is your nationality ? ” held few surprises. Among M 
children only one in the 6-7 group understood it, as compared with one-third of 
those aged 8-9 and three-quarters of the oldest ones ; corresponding figures for 
W children were, respectively, nil, a single one and one-third. Most of those 
who understood the question gave a correct reply, i.e., ‘ Scottish ’ or ‘ British,’ 


but a few said ‘ England’ or ‘ English.* The probable reason for such answers 
will be encountered shortly, 


(iii) Understanding of Relationships. 


After reference had been made to Scotland and Britain, children were 
asked specifically whether they were Scottish, and whether they were British. 


As is evident from Table 7, a majority of all ages gave an affirmative reply to 
at least one of these questions. Moreover, there is no significant difference 
between M and W, indicating that 


a verbal knowledge of nationality is acquired 
at roughly the same rate in both kinds of environment. Most of those aged 10-11 
acknowledged that they were both Scottish and British ; the problem is, how- 
ever, to what extent this corresponded to a grasp of the implicit part-whole 
relationship. 


* It will be remembered that children not of Scottish parentage had been excluded 
from the sample. 
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TABLE 7 


DISTRIBU © OF 
BUTION oF COMBINED ANSWERS TO THE QUESTIONS ‘ARE You SCOTTISH? ARE 


You BRITISH ?’ BY TYPE OF SCHOOL AND AGE. 


M Schools W Schools 


8-9 | 10-11 


‘No ' or‘ Don’t know ’ to both 


Scottish ‘Yes’ and British 
‘Don’t know ’ 
British ‘Yes’ and Scottish ‘No’ 
a Don’t know ' 
Yes ' to koth 


In order to test this, a follow-up was used with children who had said they 
were both, and with those who were positive about one but uncertain about the 
other, All these were asked about the possibility of being Scottish and British 
ited to explain their reasons. 


at the same time, and invi 
ructive. The most frequent 


the possibility was mutual exclusiveness—" you 
’ The other prominent feature, used as a 
In fact, language and nationality 


The outcome, summarized in Table 8, is inst 


reason offered for rejecting 
can’t be two different things at once.’ 


justification on either side, was language. 
were commonly mixed up, which probably led some of the children to say 


‘English’ when asked about their nationality, as mentioned earlier. A few 
quotations will illustrate the prevalent state of confusion, the first pair below 


having answered ‘ Yes ' and the second ‘ No’: 


your accent. (M8) 


British is your language and Scottish is 
d here in Scotland you still talk 


Down in England you speak British an 
British. (W10) 


I speak English—I’m really 
You can’t talk Scottish and 


Scottish. (M6) 


British at the same time. (W9) 


to be consistent, though obviously without any 
the last two, having declared previously that they 
were Scottish and British, had changed their position, yet there was no indica- 
tion that they were aware of it. On the other hand, a small number of children 
did show such an awareness. For instance, one decided “* You can t talk 
different ways,” and reverted to Scottish only ; another, possibly with a notion 
of Gaelic at the back of his mind, came to the conclusion that he was British 
but not Scottish because “ All the Scottish speak a different language to us ” ; 
one altered his answer from "No ’ to ‘ Yes,’ explaining “ Well—you can speak 
all the different languages.” Lastly, an 11-year-old, after an initial emphatic 

No,’ hesitated and finally reached the correct answer. This is an example of 
the interview as a learning situ the course of which new insights may be 


gained. 


_ The first two happen 
glimmer of understanding ; 


ation, in 
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TABLE 8 


IDEAS ABoUT THE RELATIONSHIP BETWEEN ‘SCOTTISH’ AND ‘ BRITISH.’ 


Can one be both ? Explanation given 


Don’t know 


“Can't be two things at same time’. 
Different language 
Other 


Same language . 
Other 


The miscellaneous answers contained some curious specimens. Some 
thought that Scottish and British were just different names for the same 
country ; one explained that his father was Scottish and his mother British, 
and a 6-year-old hit upon an ingenious solution : “ One week I'm Scottish and 
the next week I’m British.” 

The lesson of this follow-up is that 
cannot be taken at their face value. 
children said that one could not be 


replies to the nationality questions 
When challenged, about a third of the 
British and Scottish at the same time, 
thereby mostly contradicting their earlier responses and showing that these had 
not been rooted in a genuine understanding ; others continued to affirm the 
possibility of being both, yet without being able to provide an adequate 


Justification ; and less than half were able to demonstrate their grasp of the 
underlying relationship. 
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PSYCHO-LOGIC: A POSSIBLE ALTERNATIVE TO 
PIAGET’S FORMULATION 


By G. HARRY McLAUGHLIN 
(Northampton College of Advanced T echnology, London) 


Su MMARY. The limitations of children’s reasoning observed by Piaget can be 
described by a restricted logical system, called a psycho-logic. It is shown that 
psycho-logic can be simply represented by a small set of finite Boolean algebras. 
These are interpreted to suggest that limitations of reasoning correspond to 


limitations of immediate memory or attention. 


ed for the discovery that children behave as 
f a psycho-logic, meaning a logical 
d deductions to be made. He also 


Pracer (1947) is particularly not 
if their thinking conformed to the rules o. 
system too restricted to permit certain vali 
observed that a child's ability to reason progresses through an invariable 
sequence of four main stages, outlined below. He accounts for this sequence 
by suggesting that the development of higher stages of reasoning depends upon 
the child achieving concepts which are more and more abstract. 

_ Nearly all the generalisations on the development of reasoning given by 
Piaget (1937), Piaget and Inhelder (1941, 1959) and Inhelder and Piaget (1955), 
have been supported by numerous careful investigations, such as those fre- 
quently reported in this Journal by Peel and his co-workers. 

To describe the four stages of intellectual development, Piaget formulates 
four levels of psycho-logic. Each level is a logic so restricted that it yields only 


those inferences which can be made by a subject at the corresponding stage. 
Furthermore, Piaget advances the hypothesis that if, at any level, a formula 
at the end of the 


can be derived from another formula in psycho-logic, a subject I 
appropriate stage can carry out cognitive processes corresponding to both 
formulas. 

Psycho-logic has been 
aspects of it have been exhat 
summarised by Piaget (1947, 


set out in detail by Piaget (1949) ; particular 
istively treated by Piaget (1942, 1952) ; and it is 
1953). The last-mentioned volume has an 
Introduction by Mays explaining with admirable clarity the logical terms used 
1n psycho-logic. Unfortunately, as Beth (1950) noted, Piaget’s own expositions 
of his system contain serious confusions ‘and are consequently hard to follow. 
Grize (1960) has expressed the system in a rigorous form, but this is even more 
formidable to the non-logician, and few of its terms appear to have psycho- 
logical referents. 

_ The difficulty of Piaget’s system seem 
directly into logical terms the psycholo 
abstractness. This gives the system a struct 


ventional logics. i 

However, it will be shown that a psycho-logic can be adequately formu- 
lated in terms of the algebra of classes, which is explained in most introductions 
to modern logic (e.g., Langer 53). It will also be shown that each level can 
be simply defined by the number of different classes that are distinguished 
Simultaneously. Furthermore, this number can be directly jdentified with a 
clearly measurable psychological characteristic—memory span. 


s to stem from his attempt to translate 
gical notion of increasing conceptual 
ure different from that of con- 


6I 
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Briefly, the argument is that a subject solving an intellectual problem must 
retain a number of items at the same time, the number depending on the 
logical complexity of the problem ; and that the number of items a child can 
retain simultaneously increases as he grows older ; therefore, as the child matures, 
he can solve more complex problems. In other words, each of the four stages 
in the development of thought is characterised in part by the number of items 
that a child can retain at the same time. It should be noted that increases in 
retentivity are claimed to lead to changes only in the logical aspect of thinking : 
no doubt changes in the quality of thinking in its other aspects involve further 
factors. Furthermore, an ability to retain a certain number of items is claimed 
only to be a necessary condition for making certain kinds of inference : it is not 
proposed as a sufficient condition. 

There are at least two ways in which the number of items retained simul- 
taneously could increase with age. The older child might either be able to 
remember more items after a single presentation, or be able to attend to more 
items while they are being presented. Even more probably, memory capacity 
and attention capacity increase together, perhaps because they are two aspects 
of the same process. Tor convenience of exposition, a child’s growing ability 
to consider a certain number of items simultaneously will usually be treated 
simply as a function of enlarging immediate memory capacity, but it must be 
emphasised that this notion is not fundamental to psycho-logic. 

The size of memory span is greatly affected by redundancy of the stimulus 
material. That is to say, if an item in a list to be recalled is deducible from other 
items, a subject’s memory span will be increased by that redundant item. 
However, we will be concerned with items that are by definition non-redundant : 
these G. A. Miller (1956) has named chunks. Each digit in a digit span test 
forms a chunk, unless the subject is capable of mentally combining the digits 
into groups. Thus, from the norms of digit span tests (e.g., Wechsler 1949, 1955) 
we can say that the immediate memory can simultaneously retain two chunks 
at about 2} years of age, four by 44, and seven by 134, falling to six at about 
55 years of age. Thus, immediate memory capacity increases and declines in 
the same way as intelligence, correlating between -42 and -62 with full Wechsler 
Intelligence Scale scores. The reason why the correlation is no higher may be 
that, although a given stage cannot arise until the capacity to retain enough 
chunks has developed, it is not consistently manifested until much later, when 
the child has learnt to combine and separate the chunks mentally. 


Let us call that part of his environment which a sub 


} ject is considering at any 
moment his universe. 


This universe can be characterised by the algebra of 
classes. In this logical system, any collection of entities, physical or mental, is 
called a class. A class can consist of a single entity. A class with no members 
(e.g., the class of unicorns) is said to be empty. There can also be classes formed 
of classes. If A and B are classes, their logical product is the class of entities 
in both A and B ; and their logical sum is the class of entities in A or B or both. 
Classes which are parts of a larger class are called its subclasses. 
class comprises all the classes relevant to a given proposition. The complement 
A’ of a class A consists of all entities in the universe class that are not in A. 

_ We may now define a subject’s concept of a class as the specification by 
which he decides whether or not an entity is in that class.* To put it another 

_,” Again for simplicity of exposition, it is assumed that a subject categorises a given 
entity with certainty as either being or not being ina given class. If, however, he categorises 
an entity as possibly being in a given class, the argument of this vaper still stands, but the 


i A S P 
logical apparatus required is the modal calculus noted by Lewis and Langford (1932, p. 501) 
as being equivalent to that discussed here, ` 


The universe 
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way, a subject's concept of a class consists in his k its distinguishi 
at itas à sists in his knowledge of its distinguis 
eg ee psychology shows that we can pes single nite nae 
the ATE ity or class of entities with any number of attributes. However, 
many e y selectively attending to any combination of attributes, forms as 
big white a s a there are combinations to consider. For example, if he sees a 
Stig (hin yan, AE. may apprehend the object either under the concepts ‘ swan,’ 
‘hig we) w hite (thing),’ ‘ big white (thing),” ‘ big swan,’ ‘ white swan * or 
‘es? a hes Furthermore, the subject might restructure the concepts 
DaT a lack’ to form a concept of their product, which would be 
leaner y some such remark as Se ‘Are there black swans?” He might also 
for whict + t ne concepts ‘ swan ’ and ‘raven ’ to form a concept of their sum, 
pt n he would probably know the name ‘pird.’ By selective attention to 
spects of the concept of a sum or of a product, the subject could again form the 


component concepts ‘ swan,’ * black,’ ‘ big,’ etc. 


duces to solving a problem of the type: given 
soogty? Atthevh what other classes are or are not 
mie ba At the w orst, all the classes defined by the attributes of the given classes 
Hae e considered. The maximum number of classes that might need to be 
; onsidered is easily calculated. Suppose the given classes are distinguished by 
s number, N, of different attributes (or sets of attributes that occur together), 
hen each relevant, distinct class is defined by one of the 2N 
tions of the presence or absence of those attributes. For examp. 


fiven by the premisses of a syllogism vary in three attributes, so finding a con- 
clusion might invol In practice, many prob- 


l ve considering up to 2=8 classes. 
ems are solvable without considering all the classes that might be relevant. 
E As Bernstein (1935) has shown, the importance of N is that for each value 
tl is given, it specifies a unique logic, called a finite Boolean algebra. We will, 
nerefore, label each level of psycho-logic by the appropriate value of N—which 
must be a positive integer, unless it is 0, as will be explained below. 
n-redundant, the 


y N attributes are no. 
chunk. Therefore, the level at which a 


an be inferred from his digit span, always presuming that 
an for concepts: the subject will be at level 
3 or 4; and at level 3 if it is from 5 to 8. 


pras would still be valuable as descrip- 
e various psychological stages, even 
if stages and memory span d. But if there were no corres” 

a certain memory span is a pre-requisite for 


Pondence, the hypothesis that I 
reasoning at a certain level would have to be discarded. However, a search 
t of that cited by Munn (1954, p- 412 ff.), has failed 


of the literature, including mos l 
to reveal evidence of children making inferences while not having the memory 
Spans required by the hypothesis. Indeed, remembering that memory capacity 
is only a necessary condition for reasoning at a certain level, and that success 
cannot be assured without the maximum capacity required at that level, it will 
be shown that there is a satisfactory agreement between the ages at which the 
Various memory spans are attained and those at which the corresponding 


levels begin. 


co The following sketch of levels ar 
E nstructs that are clearly supporte 
ructs are italicised. 


Because the 2N classes defined b 
concept of each class corresponds to a 


tions of the logical status 0 


those of Piaget’s hypothetical 


nd stages uses 
ms for these con- 


d by empirical data ; ter 
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LeveL 0. This demands the capacity to process 20 1 concept at a time. 
It corresponds to the stage of sensorimotor intelligence. This is defined by Piaget 
(1937) as the period up to two years of age, during which a baby learns to form 


object concepis, that is, to treat a given object as the same object however, and 
whenever, it is perceived. 


We may suppose that some conditioning process leads the baby to categor- 
ise identically percepts similar in important respects. It is not known exactly 
how object concepts are attained. Piaget (1957, pp. 86-9) has suggested that a 
baby twiddling some object around finds that there is an invariant in the group 
of spatial displacements he has performed upon the object, and thus learns that 
the object is permanent : yet it must be pointed out that any finite number of 


twiddles may not form a group, and how invariants are extracted from percep- 
tion remains unexplained. 


For present purposes, it suffices to remark that the baby can make no 
inferences. This is due to the fact that an inference, by definition, involves 
relating at least two concepts, and at level 0 only one concept can be differen- 
tiated at a time. It may now seem paradoxical to suggest that the baby’s 
object concept has N=0 distinguishing attributes. The paradox vanishes, 
however, when it is noted that if he has only one concept in mind there is no 


other concept to distinguish it from, therefore, distinguishing attributes are 
meaningless in this context. 


_ Lever |. This level demands the capacity to retain 2!\=2 concepts 
simultaneously. It corresponds to the stage of pre-operational intelligence. 
This lasts from tw 


o to seven years of age, and is defined by P 


; J i iaget and Inhelder 
during which the infant comes to form cla 


ss concepts, that is, 


(1941) as the time 
to classify objects. 


At first the infant may fail to note the distinguishing attributes specified in 
an adult’s concept of a given class, so he may make classifications which adults 
think mistaken. To overcome this he will, presumably, try to learn the dis- 


tinguishing attributes of all manner of classes ; this he seems to be doing when 
he asks a series of questions of the form: ‘ What’s that? A cow? Why? 
What’s it for? ” 


A more serious source of confusion is the infant’s inability to distinguish 
between more than an object and its environment. He cannot at the same time 
note the relationship between the object and its environment, because that 
would necessitate the simultaneous retention of a third concept. But knowing 
the relative position of an object is essential to deciding whether percepts of 
similar objects encountered at different times should be classified as a number of 
objects of the same class, or as the same object. Piaget (1937, p. 374) has 
observed that infants are often in doubt on this score. 

On the other hand, the infant’s capacit 
enables him to draw comparisons, which in 
mentary form of inference. If he is confront 


retain a concept specifying those of its att 
If he now encounters some 
object. But as he can sim 
classify his concepts of the 
quite different : he cannot 


y to retain two concepts at once 
turn permits him to make a rudi- 
ed with an object he can form and 
tributes to which he has attended. 
new object, he can compare it with the retained 
ultaneously retain two concepts at most, he must 
two objects as being either identically the same, or 
have a third concept of their similarities and differ- 


Toa rei, if the new object has at least one of the attributes of the 
rE ie on ge P may expect to find that it has all the other attributes as 
well. Stern (1924) calls this kind of reasoning from one concrete instance to 


a 
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another ‘ s er a z 
3, a — : it is exemplified by his daughter who, at the age of 
rope strung a by eee to support the family wash, so when she saw a 
Something to pi ce Dalepnys ahe asked: “ What’s that hanging up there? 
wink at this stage is also characterised by the phenomenon of irre- 
some iit b = exemplified by an infant who is shown some brown beads and 
Seodea aan ne s. He will correctly tell a questioner that the brown beads are 
talhias] c : hat the white beads are wooden. But, if he is shown a box con- 
holds A A: naa number of brown than white beads, and he is asked whether it 
EaR ero beads or more wooden beads, he will insist that there are more 
ZOnE ae s. This would be predicted of an infant who cannot think of three 
beade simultaneously : for, if he has already classified the white wooden 
Reale oa , and the brown wooden beads as ‘ brown,’ or even ‘ brown 
+ » he has not the capacity to restructure his concepts to form a third 
mcept ‘ wooden of any colour.’ 
PM ics 2. This level demands the capacity to 
This taneously. It corresponds to the stage of concre 
pening pennen by Piaget and Inhelder (1959) as the time, between 7 and 11 
pro aC age, when the child learns how to use concrete operations, which are 
cesses of restructuring concepts, that can be illustrated by grouping tangible 


objects. 


. Suppose that a 
Considers the attributes of 
concept of the sum of A and A’. 
ane Now the child’s immediate memory can acco 
urther class, B’, which can in turn be coupled wit 
Proges; called the concrete operation of addition, may be continued indefinitely. 
ry is manifested when the child couples classes of objects to form a wider class. 
enon is also manifested when the child puts objects into a series ordered 
Y some specific relation. This is now possible because the child has the capacity, 
not only to retain concepts of each pair of objects involved, but also the further 
concept of the relations between them. 
The notion that the child forms wider class concepts by combining concepts 
of subclasses two at a time is similar to that advanced by Piaget (1942 5 1947, 
P. 43 ; 1949), who states that a child forms the concept of a class (e.g., animals) 
Y combining the concept of a subclass (e.g., insects) with the concept of the 
complement of that subclass (non-insects). But the complement seems to be 
defined with respect to the new class (thus making ‘ non-insects ’ mean ‘ animals 
hat are not insects’). This would imply that the child must already have the 
concept of the new class in order to form that concept—@ circularity which 
obviously cannot be intended. 
t Next suppose that the child has in his c 
wo classes C and D. Then the child can classify further concepts using, at the 
Same time, the two criteria, “having the attributes of C’ and ‘having the 
attributes of D.’ This is manifested in the concrete operation of multiplication, 
at is, in sorting objects into four product classes according to whether they 
have the attributes of C and D; of C but not of D ; of D but not of C; orof 
neither C nor D. Multiplication is also manifested when a child finds corres- 
Pondences between two given independent series. 
t The delay between the attainment of a four-chunk memory span, between 
he ages of 4 and 5, and the start of the concrete operational stage, at about 7 or 
years, is more apparent than real. In fact, Piaget and Inhelder (1959) give 


process up to 22=4 concepts 
te operational intelligence. 


child has formed concepts of two classes, A and A‘. If he 
A and A’ simultaneously, he thereby forms the 
This new third concept may be designated 
mmodate the concept of a 
h the concept of B. This 


immediate memory the concepts of 
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statistical tables which may be summed up by saying that no 4-year-olds can 
perform concrete operations, but that some can be carried out systematically by 
an increasing number of subjects from the age of 5. Evidence of the early 
performance of concrete operations is also given by Welch (1940) who, incident- 
ally, shows the relation between the abstractness of concepts and their number. 
Welch distinguishes stages in the development of abstract thought by the 
hierarchic order of concepts a child can form. At 26 months, on average, his 
subjects attained first-order concepts : for example, potatoes and apples were 
both classed as foods. The second stage was reached at 44 years, when his 
subjects attained second-order concepts, potatoes being classed as vegetables, 
apples as fruit, and both fruit and vegetables as foods. 


Concrete operations are said to be reversible, because the child can behave 
as if he had mentally cancelled an operation, either by doing the opposite (a 
type of reverse operation called inversion), or by performing an operation that 
compensates for the first one (reciprocity). Thus, for the first time he can argue 
that if lemonade is poured from a tall, slim glass into a short, fat one, there 
will still be the same amount to drink, because the lemonade can be poured back 
again to look as it did at first (inversion), and because decreasing the lemonade’s 
depth has been compensated for by giving it more breadth (reciprocity). 


At this stage, however, the child cannot understand how inversion and 
reciprocity can be combined and so effect correlation. For example, the child 
may be faced with an apparatus consisting of two communicating vessels 
containing a liquid ; if a piston is pressing on the liquid in one vessel, the level 
of the liquid in the other vessel will vary according to the weight of the piston 
and the density of the liquid. A child, at this stage, cannot educe the correlate 
needed to answer the problem : “ The operation of increasing the weight of the 
piston is to decreasing the weight of the piston (inversion) as increasing the 
density of the liquid (reciprocity) is to... what ? ” (Inhelder and Piaget, 1955, 
p. 148 ff.). The hypothesis of an immediate memory limited to holding four 
concepts at a time accounts for the child’s inability to educe such a correlate, 
for to understand the problem he would have to retain not only the concepts of 
an operation, its inversion and its reciprocity, but also concepts of the three 
relationships involved. 


The hypothesis also explains why it is not until the next stage that a child 
can remember to keep all factors constant except one during an investigation 
(Inhelder and Piaget, 1955, p. 46 ff.) 

LeveL 3. This level dem 
simultaneously. This corres 
which is reached by 11 to 1 
with a fully practical logic 


If the adolescents can now retain up to eight concepts, he can, in principle, 
make any inference involving classes distinguished by up to three attributes 
(or sets of attributes), such as a syllogism, so the limitations of the previous level 
vanish for nearly all practical purposes. 


Indeed, it will be an unusally complex problem that demands the retention 
of as many as eight concepts. The conclusion to the simplest form of syllogism 
(known as Barbara) can be found even with level 2 logic by a process akin to 
logical addition : the first Premiss says that all of class A has the attributes of 
class B, so that the concept of their sum, A and B, can be formed and used to 
substitute for A or for B ; the second premiss says that all of class B has the 
attributes of class C, so that class A has the attributes of class C, which is the 


ands the ability to process up to 23=8 concepts 
ponds to the stage of formal operational intelligence, 
5 years, when the adolescent thinks in conformity 


iog 
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requir SIE EE . ee 
=e adh eae However, if this kind of reasoning is applied to other 
ihe can = i may produce invalid conclusions. In fact, most fallacies in reason- 
E ior he c comia for on the hypothesis that the problem being considered 
aeae men oo of more concepts than can be kept distinct in the 
dian 4 logic, which would enable subjects to solve problems involving 
Tare + el arying in four different ways, is, of course, possible. However, itisa 
thew oe n who is intelligent enough to solve such problems mentally, because 
y require the simultaneous retention of up to 2'=16 concepts. 
e due to Professor E. A. Peel and Dr. 
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PREDICTING STUDENT PERFORMANCE IN A 
UNIVERSITY DEPARTMENT OF EDUCATION 


By F. W. WARBURTON, H. J. BUTCHER anp G. M. FORREST 
(Department of Education, University of Manchester) 


SUMMARY. One hundred teachers in training in the Department of Education, 
University of Manchester, were tested with measures of abilities, personality, 
interests, values and general culture, and provided a considerable range of 
biographical information. The predictive power of these various measures was 
examined in terms of three main criteria—final teaching mark, final theory 
mark, and award of the Graduate Certificate in Education. Degree class was 
the best single predictor of theory mark and of final award. A personality 
questionnaire (Cattell’s 16 P.F.) was the best single predictor of teaching mark. 
The successful student tended, apart from academic qualifications, to have a 
high level of general culture, high conscientiousness (16 P.F., factor G), to 


have tender-minded attitudes to education, and to participate fairly widely in 
social activities. 


I,—INTRODUCTION. 


Tue literature dealing with the prediction of success in teaching and success in 
teachers’ training courses is very large and very inconclusive. Some idea of its 
extent may be obtained from the annotated biography of Domas and Tiedeman 
(1950) which contains over 1,000 references to papers on ‘ teacher competence ’ 
up to 1950. Its inconclusiveness is also well known. As Evans (1959) says : 
“The most obvious deduction from the available evidence is that there is no 
one pattern of successful teacher. Teaching is a complex process calling for 
many different abilities, no one of which is by itself sufficient to ensure success. 


At the same time, each of these abilities makes a small but necessary contribu- 
tion to success ” (p. 33). 


Among the variables which have been previously studied as possibly 
related to teaching success are : biographical data such as age, marital status 
(eg., Ryans, 1951), intelligence and abilities (in a large number of studies), 
personality factors measured by questionnaire or by objective tests (Lamke, 
1951; Schwartz, 1950), interests (Strong, 1943), attitudes to teaching (Cook, 
Leeds and Callis, 1952) and many others. In addition to the study of theoretical 
predictors of this type, much research has been devoted to the predictive power 
of interviews and kindred methods of selection (e.g., Burroughs, 1958). 


The results have, in general, been rather disappointing. Several causes 
have clearly contributed to the prevailing uncertainty in this field. Firstly, as 
stated by Evans in the quotation above, the criterion is a complex one. Secondly, 
in some studies the criterion has been teaching mark at student level, in others 
rated success as a teacher after varying periods of service, Little is known about 
the relation between these two criteria, as is pointed out by Collins (1959), 
since the few correlations available range from + -84 (Tudhope 1942) to -+--16 
(Martin, 1944). Thirdly, the numerous variables which have been tested as 
‘predictors have generally been tried out on quite different samples, varying in 


age, nationality, sex, degree of selection and other res ith t ; t 
+O ion a pects, with the result tha 
it is hard to compare the results obtained and the relative values of the different 
types of predictor. 
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n for the present paper, for it 


These last two factors are our justificatio 
lection of data of all the types 


pa Hie administration of tests and the col 
ain me i yat one sample of students in a typical university department 
ol graduate A though the present research uses the teaching practice marks 
sainple = Stud ents as its principal criterion, it is also planned to follow up the 

after five years’ teaching experience, and thus, firstly, to provide further 


evid A : 
ey ip about the relation between practice marks and later success, but also, 
portantly, perhaps, to determine which of the predictors of student 
when considered against the 


teachi $ 4 
Semper hold up, and which deteriorate, 
and more fundamental criterion of success in teaching as a career. 


K II.—DETAILS OF THE RESEARCH. 

(a) Objectives. 
a a programme of research thus has three main aims: (i) to obtain as 
ou ete a picture as possible of a sample of graduates entering the teaching 
ession in terms of biographical and academic background, abilities, interests, 

the relative efficiency of these 


vi 7 : A A 

ue and personality factors ; (ii) to determine 
iables in predicting success in the course of professional training ; (iii) to 
teaching experience and re-determine 


fi : 
reaped up the subjects after five years’ 

e relative predictive values in terms of success in teaching as a career. The 
present paper is mainly concerned with the second of these aims. 


(5) The subjects. 
” Students taking the Graduate Certificate in Education course in the Depart- 
Bar of Education, University of Manchester, in the session 1957-58, numbered 
lost Volunteers were called for to undergo an extensive programme of psycho- 
ti gical testing, and 118 attended the first period of testing, but inevitably by the 
ime data on approximately 100 variables had been assembled, a number of 
gaps had appeared. The number of students on whom information on every 
variable was available had shrunk to rather less than 100. However, to facilitate 
pe tae by the electronic computer, and to avoid the complication of slightly 
ifferent numbers in the sample in different variables, the gaps were filled by 
assigning to a student who had missed a test, or about whom information was 
Missing, the mean score of the whole group 


on that particular variable. This 
expedient was necessary in 2-3 per cent. of the entries in a 100 x 100 score matrix. 


(c) The variables. 

i The variables included in th 

to enty-five biographical pa ticula 
aining course, as follows : 


e research comprise fifty-seven test scores, 
ia of success on the 


rs and eighteen criteria © 


TABLE 1 
(1) TEST SCORES. 
1—Non-verbal intelli Free Scale 3A (Cattell 
h: al intell e. Culture Fre (Ca! Ns 
ecg National Institute of Industrial Psychology- 
Group Test 33. 

3—Spatial ability. NTN Institute of Industrial Psychology. 
4 Group Test 80A. 

5 Memory. Primary Mental Abilities. Thurstone. 
—Verbal Fluency. Primary Mental Abilities. Thurstone- 

Mechanical Comprehension. Bennett. 


E A r 
Mechanical Comprehension. 
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7—Vocabulary. 


8—Gencral Culture, 


9—Cyclothymia vs. Schizothymia. 
10—Intelligence. 

11—Ego Strength. 
12—Dominance. 
13—Surgency. 
14—Conscientiousness. 
15—Extraversion. 
16—Sensitivity 

17—Paranoid Tendency. 
18—Unconventionality. 
19—Sophistication. 

20—Guilt Proneness (Anxiety). 
21—Radicalism 
22—Self-sufficiency. 
23—Self-control. 
24—Id Pressure (Tenseness). 
25—Outdoor Interests. 


26—Mechanical Interests. 
27—Computational Interests. 
28—Scientific Interests. 
29—Managerial Interests. 
30—Artistic Interests. 
31—Literary Interests. 
32—Musical Interests. 

33— Social Interests. 
34—Clerical Interests, 
35—Theoretical Values. 


36—Economic Values. 
37—Aesthetic Values. 

38—Social Values. 

39—Political Values. 
40—Religious Values. 

41—Radical attitudes. 
42—Tenderminded attitudes. 
43—Radical attitudes to education. 


44—Tenderminded attitudes to education, 
45—Naturalist attitudes to education. 
48—Accuracy in non-verbal intelligence 
test. 
49—Accuracy in verbal intelligence test. 
50—Accuracy in spatial ability test. 
51—Accuracy in mechanical ability test. 
52—Accuracy in vocabulary test. 
53—Accuracy in general culture test. 
54—Emphasis in personality test, 
55—Emphasis in values test. 
56—Emphasis in social attitudes test. 


57—Emphasis in radical a! 


j ttitudes to educa- 
tion. 


Predicting Student Performance 


Primary Mental Abilities. Thurstone (70 
items), together with 30 difficult words 
from various other sources. 

100 item test, constructed by H. J. Butcher 
and F. W. Warburton, Department of 

Education, University of Manchester. The 

subject is required to give the name of the 

authors of a list of books, covering the 
humanities, sciences and social sciences and 
works of art and music. 

P.F. Factor A (Cattell) 

» Factor B 
Factor C 
Factor E 
Factor F 
Factor G 
Factor H 
Factor I 
Factor L 
Factor M 
Factor N 
Factor O 
Factor Q, 
Factor Q, 
Factor Q, 
» Factor Q, 
Kuder Preference 

Form CH. 


Record-Vocational 


Study "of Values 
Lindzey). 


” ” 


Inventory of Social Attitudes (Eysenck). 


Survey of Opinions about Education (Oliver), 
Department of Education, University © 
Manchester. 


” ” ” ” 


Proportion answers correct in Test 1. 

Proportion answers correct in Test 2. 

Proportion answers correct in Test 3. 

Proportion answers correct in Test 6. 

Proportion answers correct in Test 7. 

Proportion answers correct in Test 8. 

No. ‘ extreme ' answers in 16 P.F. Test. 

No. extreme answers in study of Values test., 1 

No. extreme answers in Inventory of Socia 
Attitudes. f 

No. extreme answers (radical) Survey ° 
Opinions about Education. 
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58—E: isi A P 
men u tender-minded attitudes No. of extreme answers (tender-minded) 
59—Emphasi ation. p , Survey of Opinions about Education. 
asis in naturalist attitudes to No. extreme answers (naturalistic). Survey 
of Opinions about Education. 


education. 
60—N (2) BIOGRAPHICAL PARTICULARS. 
No. of G.C.E. ‘O 'or' A’ subjects. No. of subjects passed either at ‘O ‘or ‘A’ 
level in General Certificate of Education, 
i.e., subjects in which at least ‘ O’ level is 
61—N KES , reached. 
No. of G.C.E. ‘A’ subjects. No. of subjects passed at ‘A’ level in General 
62 Desra Certificate of Education. 
gree class. First class honours=5. 
Second class honours (upper division) =4. 
Second class honours (lower division) =3. 
Third class honours=2. 
Pass degree=1. 
No degree=0. 


Statescholar=2. 
Otherwise grant aided =1. 
No grant=0. 
Any form of teaching, €g. 


Questionnaire. 
supply, Sunday School, armed forces=1. 


No teaching=0. 


63—Grant. 


(ee i P 
4—Previous teaching experience. 


65—Degree course tutor’s reference. A-E scale. 
66—Interview grade. Mean grade of 2-3 individual interviewers 
67 EON working separately. A-E scale. 
—Social status of school. Independent School=2. 
Direct Grant=1. 


63 Grammar School=0. 
Age of university. Classified as follows : 
Oxford=2. 
Cambridge=2. 
London=1. 
Manchester=1. 
Liverpool=1. 
Leeds=1. 
Sheffield=1. 
Nottingham=1. 
Durham=1. 
Southampton=0. 
Exeter=0. 
North Staffordshire=0. 
‘Aberystwyth=0. 
69—Arts Faculty. were r i 
ocial Science=1. 
Science=0. 


70—A, 

ge. 

71—Male Sex. - 

72—No. of University offices. Questionnaire. No. given. 

73—No. of University societies. uestionnaire. No. given. 

ae of interests. Questionnaire. Ho: Pr 

—No. of games. Questionnaire. PET Aae A 

76—University residence. University Hall of Residence=2. 
Lodgings=!- 

7 Home=0. 

7—Father’s education. University =5. 
Training College=4. 
Public School=3. 
Grammar School, High School=2. 
Technical School, Commercial School, Trade 
School=1. 

Elementary School=0. 

78—Mother's education. ‘As ‘ Father's education,’ 
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79—Parent’s occupational level. 


80—Large family size. : 
81—Late position in family. 
46—Religious affiliation. 


47—Right political affiliation. 


99—Entry grade. 


91—Final Teaching Mark. 


97—Final Theory Mark. 
98—Final Certificate Award. 


82—Tutor's Teaching grade (Michaelmas 
Term)’ 


83—Tutor’s teaching grade (Lent Term). 


84—Schools’ teaching grade (Michaelmas 
term). 


85—School’s teaching grade 
86—Tutor's teaching check li 
mas term). 


(Lent term). 
st (Michael- 


87—Tutor's teaching check list 
88—Extra Tutor's 


89—School’s teachi 
90—School's teachi 
92—Psychology of 


(Lent term). 
teaching check list, 

ng check list (January). 
ng check list (April). 
education examination. 


93—Philosophy of education examination. 

94—History of education examination, 

95—Health education examination, 
6—Essay. 


100—Curriculum and Method examination. 


(d) Particulars of the main criteria, 

_ The prediction of 
be in terms of three criteria, 
(c) award or non-award of t 


Certificate in Education of t 


A-E scale. 


teaching success to be re 


(a) final teaching mark, (b) 
he certificate. 


Predicting Student Performance 


Grades I-V. 
(H.MLS.O.). 
No. of children. 

Ist, 2nd, etc. 

Questionnaire. 

Fundamentalist Sects=5. 

Nonconformists=4. 

Roman Catholics 

Church of England=1. 

No religion=0. 

Questionnaire. 

Conservative=4. 

Liberal=3. 

Labour=1. 

Communist =0. 

A-E scale. A mark given before entrance to 
the Department of Education. It is based 
on the applicant's complete record, includ- 
ing examination, tutor's reference, inter- 
view grades and life record, 


Classification of Occupations 


(3) CRITERIA oF Success. 


A-E scale. Graduate Cer 


tificate in Education 
examination. 


A-E scale by tutor supervising teaching 
practice. 

A-E scale b 
practice, 
A-E scale by headmaster of school in which 
student’s school practice was carried out. 


y tutor supervising teaching 


Check list of 43 items concerned w 
student's disciplinary control, 
struction, attitude to pupils, etc, 


ith 
lesson con- 


” ” ” ” 
” ” ” ” 


Graduate Certificate in Education 
examination, 


final theory mark, 
All three refer to the Graduate 


— 
R 
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Final teaching mark is based primari i i 
ae |s primarily on a tutor’s observations and 
La of a student during about twelve weeks of school practice, although 
eg ‘pee the schools and from an external examiner will, in many cases, 
oan - o affect it. Final theory mark is based on the average of five exam- 
ihe roe per menn one long essay. Award of the Graduate Certificate represents 
sal fe on of a group of examiners on receiving reports of final teaching mark 
nal theory mark, together with the testimony of examiners who know the 


student. 
II].—THE PREDICTION OF SUCCESS AT TEACHING. 
(@) Prediction from individual measures. 


care All significant correlations obtained between any of the tests or biographical 
ariables and the three main criteria are shown in Table 2 below : 


TABLE 2 


D THE MAIN CRITERIA OF 


SIGNIFICANT CORRELATIONS BETWEEN THE PREDICTORS AN: 
SUCCESS. 


Criteria 
Predictors Final Final Certificate 
Teaching | Theory award 
Mark Mark 
TEsTs : 
J—Vocabulary. a ss emne peace tis Semewes eina se — +236 = 
8—General Culture a = -399* -308* 
11—Ego strength (16 P.F.) = -194 Š$ 
14—Conscientiousness (16 P. -236 -236 -198 
16—Sensitivity (16 P.F.) .. -220 —- “215 
18—Unconventionality (16 P.F. — -199 — 
23—Self-Control (16 P.F.) --+-- 223 = Se 
36—Economic Values (Vernon) .--- — —:213 —:257* 
44—Tendermindedness in education — +199 “210 
49—Accuracy (verbal intelligence) — -203 —: 
52—Accuracy (vocabulary) +--+. 2777" a4 — +258* +231 
54—Extreme responses to personality question- 
NBE ean SrCe seis en oe — —:275* — -204 
57—Emphasis on scale of radicalism in education. - — :262* -246 
Bro : 
Pra pene eee _ | 24, | ae 
62—Degree class : +238 :569* 514* 
63—Grant ...--+++++° TA eine eee 
65—Degree Course, Tutor’s 2 $ j 
66—Taterview grade .---+-+eseee es . :265* :289* :254* 
72— No. University offices held : a = ken E 
74—No. of interests : ` oo ae 2 oe 


above table are significant at the -05 level, except 


All the correlations in the 
k, which are at the -O1 level. 


those marked with an asteris' 
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(i) Prediction from Psychological tests. 


Table 2 shows that of the 171 correlations between test scores and the 
three main criteria, only twenty-two (13 per cent.) are significant. This may 
seem a small number, but it must be remembered that we would not necessarily 
expect a large proportion of variables to be related to success in teaching, or 
in any other particular profession. As regards the type of test, significant 
correlations were obtained for 17 per cent. of the cognitive test scores, 18 per 
cent. of the personality measures, 0 per cent. of the interests, 10 per cent. of the 
values and 15 per cent. of the attitudes assessments (22 per cent. for attitudes 
to education). There is an important reason, however, why the size of the 
correlation coefficients should not be taken quite at its face value. This is the 
familiar problem of correction for range. Although it is impossible to state the 
extent to which our sample of graduate students forms a selected group com- 
pared with the total group of successful and unsuccessful applicants for entry 


to the course, since the degree of selection and consequent narrowing of range 
will be different for every variable, ther: 


e is little doubt that if the tests and 
predictors had been administered to all candidates, the majority of the correla- 
tion coefficients would have been markedly higher, and many of those that are 


at present only suggestive would become statistically significant. A striking 
example of the effects of correction for selection in a rather similar situation is 
provided by Vernon’s (1950) study of Civil Service Selection procedure, in 
which all the separate, uncorrected validities are extremely low, but, when 
corrected for selection and combined, yield a considerable degree of prediction. 
On the whole, the results indicate the relative success of two single long tests, 
Cattell’s 16 P.F. personality test, and Oliver’s “ Survey of Opinions about 
Education,” and the relative failure of the cognitive tests and the measures of 
interests. For success on the theoretical side of the course, culture and vocabu- 
lary are important, suggesting that Arts students, with their literary and verbal 

Ego-strength, conscientiousness 


background, have an advantage in this respect, 
and lack of extreme personality characteristics are also important, perhaps in 
the preparation of examinations. There is also some indication that tender- 
mindedness, radical attitudes and originality (unconventionality) are a factor in 
examination success, i.e., in pleasing the examiners, i 

The ‘ General Culture Test ’ emerges as a better predictor of theory mark 
than any other single test. This unpublished test (Butcher and Warburton) is 
prima facie, a rather crude Measure of extent of reading. It consists of 100 
items, the large majority of 


3 l these being the titles of books, with the respondent 
being required to name the author. In the remainder of the items, the same 


principle is applied to well-known paintings and musical compositions. 


, Turning to the variables that do not si 
noticeable that all six of the ability tests (variables 1-6) fall into this class, 
presumably because of the high gen i 
range, of the sample. t whereas one at least of each type 
of test correlates with and hence with the Certificate 
Award), only Personality scores are related significantly to practical teaching. 
The finding that Practical teaching is harder to predict than theoretical success 
3s typical of research in this area, The personality qualities related to Practical 
Teaching are self-control (Q3), conscientiousness" (G) and sensitivity (I), This 
finding is in agreement with an American research by Erickson (Barr, et al. 
1961), who administered the 16 P.F. to a group of sixty-four teachers in Wisconsin 
High Schools, Erickson found factors Qs, G and M- to be related to teaching 


ys 
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Sueese E 5 r 
pe with obtained correlations which were all of the order -2 to -3. The 
stabilit p annes are aspects of the major second order personality factor of 
do T w hile sensitivity is related to tendermindedness, an attitude which is 
rainig TERENON this study to be positively correlated with success on this 
in ve et i e. This is not a surprising result in view of contemporary trends 


Positive correlations betweer i 

A t a s een factors G and Q, and teaching success are 

iat in the 16 P.F. handbook, but the correlation with factor I (Emotional 

i ha is a new finding which may, however, be related to cultural differ- 
es between the U.S.A. and Britain. Certainly, both subjective observation 


and empirical research (e.g., Cattell and Warburton, 1961) suggest that this 
is both more highly valued by and 


a of sensitivity or tendermindedness 15 

ee of the European than of the North American culture-pattern. 
a value of the 16 P.F. test in the present context appears to be that it alone, 
among the wide range of tests used, predicts teaching practice better than 


teaching theory. 


Besides the significant correlations, it is interesting to note those that, 


perhaps against expectation, are of negligible size. In particular, verbal and 
non-verbal intelligence are irrelevant to teaching success in this sample, except 
in so far as non-verbal intelligence shows a negative (but non-significant) 
correlation of —'125. Similarly, the ex f scores provided by the 


tensive series 0 
Kuder Preference Record and the Allp 


: ort-Vernon measures of values fail to 
yield any significant correlation, although one or two are suggestive of some 
relationship. 


Non-significant but suggestive correlations with teaching mark are shown 


below : 
Åge pith canna sa ar Reg Ce a + +173 
No. Gf GOE. passes est testes seine NIE + +172 
No. of interests (as stated on application form for 
+ +166 


entry to course) .-+-+-srrrecer res 
+ +165 


Family size 

Accuracy on vocabulary jest a eiqzste uss tetain' nie aeii + -165 

Vee baliiediGyt essea eens ove” aa +155 

Natatalism in education’ “sem ersen eaa A —151 

Paranoia or suspiciousness (16° BEE) scams oor —142 

Social interests (Kade esra +:138 
—:136 


mic values (Allport-Vernon) 


these correlations, although small, make quite good psycho- 
comment. The concept of ‘ natural- 


las In general, 
logical sense and require little explanatory 
1959 ; Oliver and Butcher, 1962) refers approxi- 
A teacher 


ism ’ in education (Butcher, iver, u € 
mately to a ‘ child-centred,’ ‘ progressive grouping of attitudes. 
Scoring high on the scale 0 naturalism would be expected, for instance, 10 


approve and follow many of the precepts of John Dewey. 


Econo: 
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(ii) Prediction from biographical data. 


Twenty-five per cent. of the correlations between biographical particulars 
and the three main criteria are significant as compared with 13 per cent. for 
tests. As with the psychological tests, examination success is easier to predict 
than practical teaching. Academic capacity is important, but there is also a 
relationship with breadth of interests (number of interests and university offices 
held). Two points of interest may be mentioned about these results in respect of 
theory marks. Firstly, the correlation between degree class and theory mark is 
remarkably high, particulaily when one considers the necessarily coarse grading 
in which degree class has to be expressed. From the point of view of predicting 
theory mark (and, indeed, award of the certificate), degree class is much more 


efficient than entry grade, although all available evidence is taken into account 
in arriving at the latter. 


Secondly, this is also true of tutor’s reference. 


award of the certificate are better predicted by the report on a student’s 


ry grade, although the 
ass is already known to the 
all the information available to the 
interviewer, on the results of the interview, and on any further information 
e, such as additional references. 
variables but these essentially 


Putting the test scores and biographical data together, the good classroom 
teacher appears to be essentially stable, sensitive and of good academic capacity. 


Students Successful in examination have the same characteristics, together with 
good verbal capacity and tenderminded and radical attitudes to educational 
problems, 


(è) Multiple Correlations. 


of predictors for practical teaching, the theory exam- 
ination and the final certificate award. Thes iabl i i a 
single test. tai rR oe ese variables are all derived from 


The test of general 

F ung . ecuve than the test of educational 

ere is a tendency, but it is not very marked, for the multiple 

cory examination than for practical teaching. 
An interesting findin 

particulars afforded e Ee ee 


y better icti ‘ 
variables (-639). It should, op action (871) 
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TABLE 3 
MULTIPLE CORRELATIONS. 
Practical Final 


Teaching Certificate 
Grade 


+270 -335 
-438 x 448 
+330 s -268 


8 ability tests 
15 personality variables 


10 interest scores papa in 
p values ag +167 x +298 
2 attitudes ar -084 : 134 
3 educational attitudes..........00+0e eres -207 +257 

42 psychological variables 

21 biographical particulars 


-653 -639 
-501 “871 


LV.—IMPLICATIONS FOR SELECTION PROCEDURES. 


Although considerations of space prevent a full discussion of the several 
factor analyses carried out, it will be convenient simply to list the seventeen 
main factors which emerged in the most satisfactory of the analyses and to 

One 


discuss the most promising predictive measures under these headings. 
factor loaded almost exclusively on criterion variables and was interpreted as 
teaching ability. The remaining sixteen were interpreted as follows : 


(1) Academic success.* 

(2) Verbal ability. 

(3) Non-verbal ability. 

(4). Stability of personality.* 
(5) Extraversion. 

(6) Radicalism. 

(7) I ‘ender-mindedness.* 
(8) Emphasis in expressing opinions. 
( 


t in arts subjects versis science subjects. 


9) Interes 
(10) Aesthetic interests. 


(11) Participation in social activities.* 
(12 Socio-economic status. 
(13) Family size. 
nent.* 


(14) Assessment on entry to Departn 
(15) Verbal fluency. 
(16) General culture.* 


Six of these factors were loaded by variables which appeared to have a 
in criteria, and these have been marked by 


ing relation with the mal 1 a ; 
They are also shown, with the more promising test 


asterisks in the above list. 
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d biographical variables, in Table 4 below, which includes the combine 
editti iects of the groups of variables. The correlations would probably 
be appreciably higher if the technique of multiple correlation had been used in 
place of the simple pooling with unit weights. 


TABLE 4 
TESTS TO BE USED IN SUGGESTED SELECTION PROCEDURE. 
Theory Practical | Certificate 


Marks Teaching Award 


(1) Entry ASSESSMENT. 
Final entry grade 3 š RT +358 +276 +322 
Tutors’ reference (degree course) +366 +249 4-415 


Total (unit weights) +408 4-296 


-+416 


(2) ACADEMIC Success, 
University degree class +569 +238 


Total (unit weights)... -+238 


(3) GENERAL CULTURE. 
General Culture (Butcher and Warburton) 


-005 
Vocabulary (Thurstone) 


+054 


Total (Unit weights) 


-+128 


(4) STABILITY. 
Conscientiousness (16 P.F.G.) +236 
Self-control (16 P.F.—03). -+223 


-281 


-+215 
+210 


277 


(6) SocraL ACTIVITIES. 
No. of expressed interests +2 -++263 
No. of University offices held +108 


+231 


-708 


_In considering these findings, it is also interesting to note the factors to 
which success in the teachin; 


n 8 course is largely unrelated. These are verbal and 
ae Verbal ability, extraversion, radicalism, emphasis in expressing opinions, 
arts versus science interests, aesthetic interests, socio-economic status, family 
size and verbal fluency. 


“S 
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I 7 a ” > . 
irate thi — therefore, appear best in selecting potential teachers to concen- 
Rach on ress ao and general culture rather than ability, on stability rather 
eon paren ersion or introversion, on academic rather than aesthetic interests, 
Somi articipation in social activities rather than on social or domestic back- 
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CRITICAL NOTE 
SPECIFIC DYSLEXIA 


By D. RUSSELL DAVIS* anb ASHER CASHDAN + 
(Medical Psychology Laboratory, University of Cambridge.) 


In areas of psychopathology where aetiologies are but imperfectly understood, 
and where methods of treatment are still a matter of argument, it is not surprising 
that controversy should rage over appropriate groupings of symptoms and the 
terminology most suitable for them. One such case is that of specific developmental 
dyslexia (formerly word-blindness), which under its new name has recently been 
receiving fresh attention (see Zangwill, 1960 ; Vernon, 1962; Miles, 1962). In our 
opinion, the controversy over the existence of this condition is often based on an 


imperfect understanding of the cardinal principles of scientific investigation and 
classification. 


A disquietingly large number of children suffer from serious backwardness in 
reading. Some catch up quickly, others never learn to read well. Some respond to 
specialist remedial teaching ; others show a temporary improvement, but relapse 
as soon as the special teaching is withdrawn (see Collins, 1961); others show no 
positive response at all. Some of the sufferers are girls, but more of them are boys. 
Some are of normal or superior intelligence, far more of them are of low intelligence. 
Many of them have other educational disabilities ; many also have emotional and 
family problems. A few of them have unusual patterns of handedness and eye 
dominance ; a few also are claimed by some investigators to have special perceptual 


difficulties, or difficulty in associating symbols ; such perceptual difficulties may be 
limited to the activity of reading, or may be more general. 


The above list is not exhaustive, but implicit in it is the problem that concerns 
us. This is, to what extent are any, or all, of these factors associated with each 
other. If they are, we need to know which of the associations are causal ones, and 
in which direction the causal relationship operates. For instance, does backwardness 
in reading produce emotional disturbance, or do emotional disturbances lead to a 
refusal to learn to read ? Or are both true ? 


_If we studied and treated each case of reading disability separately, we might 
achieve fair success in curing the disability (or removing the impediment to learning). 
We would not, however, be able to draw 


isa bili any conclusions about the cause of the 
disability or the mode of operation of the ‘ A + EAUS 


a cure , unless we were able to plan and 
carry out the treatment experimentally. 


1 : t | y—and a single case is rarely sufficient for a 
satisfactory experiment. Again, if we gave identical remedial treatment to two 
groups of children, each having distinct features—such as right-handed and left- 
handed children, or children of low and high tested intelligence, or simply boys and 
girls—suppose then that one group improved significantly more than the other, we 
might conclude that left-handed children, say, have more persistent difficulties than 
right-handed ones, or that girls recover more easily than boys. Even in such @ 
situation, alternative hypotheses would be tenable, but these alternatives could 
that one group had done better tha” 
c provided always that the result was & 
In other words, it would be reasonable to classify retarded boys 


separately from retarded irls if the two group i tment. 
But, if the response of tes a groups responded differently to treatme 


© groups was not clearly different from each other, 
no case would so far have been ifyi 
made out for classif T Si ¥ ou 
r assifying them as separate groups O 


another would have justified the classification ( 


* Now at the Department of Me 


ntal Health, Bristol University. 
t Now at the Department of Education, Manchester University. 
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In fa i ; i 
fact, if any category is to be established within the group of backward 
monstrably different in a sig- 


Tea i 
cee a must, to serve any useful purpose, be de: 
purpose a a from other (or some particular group of) backward readers. The 
E ort such a differentiation would be to make easier the task of preventing, 
SETE TE arane ae course of the condition. The study of symptoms, and their 
Pe , is only useful in so far as it throws light hologi i 
Gries ight on psychological mec} 
is somehow relevant to these three tasks. - a Ë AT 
esting then that it must satisfy one 
1 ct of aetiology, prognosis, or 
the difference : ers only in respect of symptoms, then 
LS A pa nces are either trivial or point to a need fo 
ogy, prognosis or treatment. 
Let us now, in this light, examine the proposed category of specific dyslexia, 


and i eo te 
nee ne each criterion in turn, ask whether the members of this proposed category 
e differentiated from other backward readers. 
form of reading backwardness. There 


i5 wine is known of the aetiology of any 
Re Diese certainly some relationship with low intelligence, and with social class, 
en this is not well documented. Hereditary factors have been suggested by 
Hallgren (1950) found 


some i ti i 5 
investigators, but there is no clear-cut evidence. 
this finding is so divergent from those of other 


onlitary factors in almost all cases ; ; 3 e 
iada as to make us hesitant to accept any of his conclusions. Zangwill (1960) 
hand. a significant number of affected relations in the families of dyslexics with 
TA eae problems. But his work is also inconclusive. Such studies raise, 
ie entally, the question of the value of history-taking to establish familial features 
ie Pathological groups. Quite simply, if the mother of a child with some pathology 
aa ited to produce relatives with simi s highly inclined to do this 
child = the same time to alleviate her own guilt The mother of a normal 
value ye no such inclination. An investigation -taking may be of 
ence J however, if it is designed deliberately to show whether the pattern of incid- 

all K affected relatives conforms to the one predicted from a particular hypothesis. 

is gren’s hypothesis, for instance, was that the pathogenic gene is dominant, but 
method of proving that the observed pattern of incidence does not depart sig- 


Nificantly from the predicted pattern was hardly satisfactory. 

has So much for aetiology. We can do no better on prognosis. As Vernon (1962) 
ay Pointed out, we certainly need to know why, in groups of children given remedial 
e aR; some improve while others do not, but in fact we do not yet know why 
“ th happens. Similarly, too, with methods of treatment. Miles (1962) says that 
n iere is broad agreement as to the general type of remedial teaching that is 
Ormally called for,” and he advocates, amongst other things, emphasis on phonic 
k use of all relevant sense-modalities 
but still await 


X 
Sis than on look-and-say methods, and the ‘ eva 
the learning process. We would agree with his prescription, 
icated for any specific group of poor readers. 


evi 
dence that it is indica 


lar histories, she is 
feelings). 
based on history 


We cannot, therefore, agree that there is any good evidence for Vernon (1962)’s 
Assertion that ‘í there mee exe a number of cases who have great difficulty in 
ms not to be attributable to environmental 

reasons why psychologists 

Jated to the 


earni 
iene to read, which difficulty seems, 
and Sorto emotional maladjustment. There are many eso ate 18 
clini Others might wish to spread such a belief. Some o it cane ee 
ab ical management of cases, and others to the desire to suppor a priori 
Out neurology and genetics. But there is not much evidence to support the 
There are itimat n which advances may be made in this field. 
al i mas By clinical lege tigation and by testing theories and hypotheses in the 
eee situation, or even by developing new insights there. But this has to be 
att wed by the second method, which is to examine the relevance of separable 
SS (whether aetiological, symptomatic or prognostic), one time, in specially 
cted groups of children who arè backw: 


e ways i 


ata 
ard in reading. 
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To sum up, it is legitimate to adopt for research purposes the hypothesis that 
there is a special class of backwardness in reading, which we may call specific 
developmental dyslexia, and which has one or more characteristics, and f then to 
carry out experiments to test the hypothesis itself, or, more likely, a deduction from 
it. Whether such a hypothesis is a promising one at the present time is a matter of 
opinion. But it would only be legitimate to assert or imply that the existence of this 
class had been established if evidence could be provided of a quality which deserved 
general acceptance. We have indicated in this note the sort of evidence which would 
be required, and we have given our opinion that it has not yet been provided. 
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SPECIFIC DYSLEXIA 
A RE 7A 
REPLY TO DR. RUSSELL DAVIS AND MR. ASHER CASHDAN 


By M. D. VERNON 


lt was mi g 
esas ae mada SS in my paper, “ Specific Dyslexia ’ that the classification of 
hypothesis alts borers, | backwardness as specific dyslexics ’ was presented as a 
ehaviouy ae MEGS the association together of certain symptoms and types of 
aena WE were reading backwardness with above chance frequency, and in the 
certain other characteristics such as low intelligence and emotional 


Maladjus P 
omen Saen (except in so far as the latter developed after rather than before the 
Hypsitiesicr 4 ing difficulties). There seems to be reliable evidence to support this 
Cashdan de notably that obtained by Dr. Ingram (to whom Dr. Davis and Mr. 
what mig} H aot refer). This hypothesis does no more than suggest tentatively 
suggest pM e the causal factors, which are obviously very obscure. But it is, I 
research scientifically interesting, and valuable in so far as it may stimulate further 
“the ia (which may, of course, disprove it). Whether the hypothesis is useful in 
medie > of preventing, curing or forecasting the course of the condition ” is a 
sug rake question on which I am not qualified to adjudicate ; though Į would at least 
might that it might be desirable to consider whether backward readers who 
be ' specific dyslexics ’ should be treated in a somewhat different manner 


trom 1 
n those whose backwardness appears to be due primarily to environmental 
vslexic ’ may be preferable 


fact A 

eae Incidentally, I think that the term i specific dy 

well at of ‘ specific dyslexia > in so far as the latter suggests the existence of a single 
-defined clinical entity, W 


hich seems improbable. 


rE entirely agree that the clinical diagnosis of cases of reading backwardness is 
much woefully inefficient, and I have said so repeatedly. I have also urged that 
esi more thorough and systematic experimental examination 1S necessary of the 
ate hological processes characterizing backward readers, as well as of their clinical 
Butt I agree also that all too little is known about prognosis and treatment. 
Seams cannot agree that “the second legitimate way of advance must be to 
ubilisgoe the relevance of separable factors one at a time. This method has been 
oben again and again in the past with little useful result. What we need to 
erve now is the nature an ncy of the association between certain 


aeti i d frequer 
iological, symptomatic and prognostic factors in backward readers. When these 
to offer further hypotheses as to causality. 


1 F 
have been established, it may be possible 


rec said here, I should like to quote from a paper in a 
Dee, published book, Reading Disability: Progress and Research Needs tn 
yslexia, edited by John Money, Associate Professor of Medical Psychology and 
cane at Johns Hopkins University (Baltimore, The Johns Hopkins Press, 
me 2): “ It is not at all rare in psychological medicine, nor in other branches of 
medicine, that a disease should have no unique identifiying sign, that uniqueness 
mig in the pattern of signs that appear in contiguity. „Out of context, each sign 
S Sht be encountered in other diseases, OT, in different intensities, in the healthy. 
becific dyslexia is no exception in this respect.” 


In support of what L have 
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A BRIEF COMMENT 


ON 
THE EFFECT OF SOCIO-ECONOMIC STATUS ON THE SCHOLASTIC 
ACHIEVEMENT OF SIXTH-GRADE CHILDREN, by ROBERT L. CURRY* 


By R. R. DALE 
(University College, Swansea) 


ms to have no effect upon the scholastic achievemen 
tellectual ability.” Un- 
Even within his high 
apparent as within his low ability 
superior scores in all cases. The 


d by chance, and to say that it has 
certainly occurred by chance—which is the equivalent of saying that in this case 
socio-economic status has no effect—is statist: 


There is also a second fact which de 
produces a patterned result there is 
the pattern when we are considering 
results there are in effect eighteen d 


e the extreme socio-economic groups, comparing 
upper with lower, there are nine differences in mean mark to be considered, and all 
nine are in the ‘ expected ’ direction 
effect, and although when we co: 


l er in the high intellectual ability groups, we 
might well conclude that “ the trend is continued but lacks statistical significance ; 
such significance might be obtained in future experiments by increasing the size of 
the sample.” We need to bear in mind, also, that although we have by consent 
drawn an artificial line between 


g n ‘ significant ’ and ‘non-significant,’ there is in 
actuality no such barrier, but a gradual transition, 


* This Journal, May, 1962, p. 46. 
tIn Reading, Arithmetic and Language. 
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RESEARCH NOTE 


THE RE 
HE RELATIVE PERSONALITY ADJUSTMENT OF SEVERELY DEAF 
AND PARTIALLY DEAF CHILDREN 


By L. R. BOWYER, A. MARSHALL anp K. WEDDELL. 


ved electronic aids, of placing partially deaf 
jon of their need for special help with 
at a partially deaf person may 
verely deaf. 


stdin plien consequent on impro 
sociai meer ordinary schools, has raised a questi 2 
eee emotional problems. One hypothesis is th 

nore of a personality problem than one who is se 


Bristol between May and September, 1962, to 
s and to see if there was evidence to 


A pilot study was carried out in 
re studied, each with 


y oyt ways of investigating this hypothesi. 
ten ae more time-consuming research. Three groups we v 
The hildren, respectively severely deaf, partially deaf, and of normal hearing. 
(or far range was restricted to 8-10 years. ‘The groups were matched by paired 
Sin ae comparison for age, sex and intelligence. Ranges of LQ., rather than 
levee 1.Q. figures were matched, corresponding to the Stanford-Binet and Wechsler 
els of Superior, Bright, Good Average, Average, Low Average, and Dull or E.S.N. 


For ths 3 ; 
or this purpose a judgment was made based on recorded test scores, teachers 
The severely deaf 


i eng and the results of a Goodenough Draw-a-Man test. 

partal (Grade IIT deafness) were all from one Special School for the Deaf. The 

ordi ially deaf children (Grade II deafness) were either members of a deaf unit in an 
inary primary school or were individuals in classes of children with normal 


hearing. 


THe 
HE CRITERION oF ADJUSTMENT. 
vas one of ‘ adequacy of 


fa _The criterion of adjustment devised for this study V f : 
= ntasy,’ assessed from a Lowenfeld ‘ World ’ and a Lowenfeld Mosaic pattern, with 
sae eran also of qualitative features in the Draw-a-Man test. This criterion was 
i Opted for two reasons. First, the Lowenfeld techniques do not need words either 
oe e instructions or in response, thus equating the communication problem for all 
ps. 
dy of deaf children, (Brill, 1960) ae used w 
ik owed a trend in favour of the severely 
te en hors think that this 


deaf children, but it was not statistically significant. 
a ee have been because they omitted to eae those with deaf parents from those 
be hearing parents. In addition, however, the criterion 0! 
en too simple or one-sided. The present attempt to gain an } r 
i i ‘ective techniques 1S aimed at balancing 
ce of research it would be important 
© use both ideas. In our pil de of the Vinelan 
am and we are grateful to erbert for helping with 
now th Goodenough and Mosaic tests. Attention was also give 
= Wledge of the children’s homes. Only the ‘ adequacy of fantasy ' resu 

Ported here. 


aie Secondly, a recent American stu 
erion of how much each child was 
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THE SCORING SYSTEM. 


The ‘ World’ and the 


Mosaic patterns were scored in the following way, with 
one point for each aspect : 


Sanp Tray 


Mosaics 
Recognisable Adequate Freedom Good Freedom Total 
Theme Content from Gestalt from possible 
(more than 4 adverse adverse Score 
categories in Clinical Clinical 
relationship) signs signs 
+ + + + + 5 


eee 
The validity of the World ‘Cl 
aggressive) had been est: 


inical signs’ (empty, unpeopled, rigid, over-fenced, 
and also for the Mos 


ablished in previous research (Buhler, 1951; Bowyer, 1956) 
aics (empty centre, frame design, etc., Lowen feld, 1952), 


RESULTS., 


The average scores for the sev 
were 4-1, 3-4 and 2-85. With the Sign test of stati 
parametric Statistics, p.68), using the hearing group as a control or norm, the severely 
deaf children in this Study are above average in adjustment (N 10, x=3, p= +05). 


However, the partially deaf children are not below average to a statistically sig- 
nificant degree. 


The results of comparing the tw. 
Sex, age, and intelligence, gives a difference in favour of the sev 
not statistically significant, although it appears notice. 


speak. A colleague has suggested that a more power! 
of variance, might bring out 


erely deaf, normal, and partially deaf children 


stical significance (Siegel, Non- 


o deaf groups, in matched pairs, equated for 


erely deaf which is 
able to the naked eye, so to 


ful statistic, such as analysis 
any significance that may be there. 


It is possible to obtain a significant difference between the two deaf groups 
with the Sign test, if one abandons paired comparison, regarding it as having served 
its purpose in equating the groups. With the scores arranged randomly, p= -002 
can be obtained. By pairing (a) for intelligence regardless of age or (b) for age 
regardless of intelligence, then probabilities of -06 


Severely Deaf 


Partially Deaf 
J.B. 5-4 V.W. 31 
S.B. s+ 7 2 
S.H. 3 — 4 
J.A. of i 
OC s+ 2 
M.H. 5= 5 
R.G. 44 1 
RR. 5+ 2) 
LF. 3 34 
ALR. 34 r 


» Sex, and intelligence. Tiesi 
tinue research on relativ 


O i 
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THE Next STAGE. 

s to look at the playground behaviour in those 
schools with deaf units, to find out how many social contacts take place between 
Partially deaf and hearing children. The other is to proceed with a larger sample 
of children in a school which teaches both severely deaf and partially deaf children. 
For it may be that the partially deaf child is less well adjusted only when ue is 
educated as a member of a minority group in an ordinary school for hearing chi T 
ie., adjustment may be a function of the Special School with its entire stafi, 


buildings, equipment and grounds devoted to helping the handicapped child. The 
alternative ‘hynotienia KA be that even in this favourable environment Me 
partially deaf child by the nature of his handicap, is 2 ‘ marginal man ’ wh 


neither with deaf nor hearing children. 


Two studies are planned. One i 
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A Survey and Discussion of the Use of Reports to Parents in Scottish and 
Other Schools. 


By F. J. R. HONIGMANN 
(Summary of thesis presented in part fulfilment of requirements for the degree of Ed.B. 
in the University of Aberdeen in 1962) 
AIM. 


To review the practices in the issue of reports to parents ; 
practice in Scotland with that i 


States ; and to consider the ide 


to compare the 
n selected parts of England, Wales and the United 
al form of such a report. 


Data. 


Thirty-five education authorities in Scotland, tw 
Wales and six in U.S.A. (total seventy-two) 
reports to parents. Twenty-nine from Scotl 
representative sample for Scotland. 
only) were also representative, In 
seems generally to be left to individ 
English schools but less than half se 


enty-two in England, nine in 
were asked to submit specimens of 
and replied, providing a reasonably 
Replies from Wales and U.S.A. (large cities 
England, the responsibility for issue of reports 
ual schools ; requests were sent to a sample of 
nt specimen reports. 

METHOD. 


Reports from secondary schools were selected fo 
sidered were : frequency of issue; format ; 
given (examination marks, stand 


r examination, Points con- 


nature and quantity of information 
ards of comparison, ratin 


8S, space provided for 
remarks, aspects of performance assessed), 
Resutts. 

Scottish reports tend to be restricted to an Impersonal list of examination marks 
on formal subjects with very little provision for y erbal comments or assessment of 


Conctustons, 


Developments in educational practice have not be i f 
a a en reflect: e form 0 
reports to parents in Scott. E cted in the fo 


p and. Alternative forms which communicate information 
more effectively and aid parent-teacher collaboration are suggested, 


An Evaluation of Readability Formulae in assessing Age-Placement 0f 
Text Books, 


By MARGARET TAYLOR 
(Summary of thesis Presented in $ g 2 ia 
in the University of Aberd i ESS 


To study the validit r 
Chall and Der y of three Americar 


'vereux, as applied = 
books graded by pu PP an to the age-plac 


2 readability formulae, Lorge, Dale- 
blishers as appropriate fo: 


ement of twelve British school reading 
r children aged 9 to 10 years. 
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Metuop, 
The criteri e i 

forlyfve e TD used for evaluating the formulae was the combined judgement of 
Da kewas ane rine teachers. An order of difficulty among the twelve reading 
application of = atea from the teachers’ rankings, and the order obtained from the 
alo fesad eao h of the formulae was compared with this criterion. The teachers 
which we ae tartos which contribute to the age-placement of the books but 
exercises and oe. the formulae, such as format, appropriateness of content, 


Resurts, 
their Ng ratings showed an average intercorrelation of over -+-0-7 and, , 
criterion ane udgment was, therefore, accepted as a reasonably consistent 
tion with ai He a orge and the Dale-Chell formulae showed moderately high correla- 
Bave a corr eee (-+0-7 and +-0-6 on rank-order), while the Devereux formula 
other ie ation of only +05. ‘All three formulae correlated highly with each 
tended to ha am between +0 8 and +0:9). Age placements by the formulae 
tion of +0 3 hat high by comparison with teachers’ opinions. There was a correla- 
of lis Boole: etween the teachers’ order of difficulty and their ranking of the merit 
of other ital hool reading texts. Additional comments from teachers and ratings 
aspects were found to add relevant information beyond that given by the 


formulae, 


ConcLusion, 

the ne value of two of the readability formulae was established, but only within 

eh imits of what these formulae aim to measure. Readability, however, is shown 
e only one of several factors relevant to the appropriate age-placement of these 


School texts. 


A Gaelic Translation of the Wechsler Intelligence Scale for Children. 


By MARY MORRISON 


(Summary of thesis presented in part fulfilment of requirements for the degree of Ed.B.in 
the University of Aberdeen in 1961) 


Alm. 
ith reference to the 


ian test the validity of a n 
y of the effect of bilingualism on int 


ew Gaelic translation of WISC, w 
elligence. 


ril to September 1948 


Susyxcrs, 
130 such children, 128 


. All Gaelic speaki shildr 

(inelusi aelic speaking childr 
ve), age 5 3 

Were sr i d 11:6 to 12:0 


Ap 


en in Lewis, born in 
of 


at the time of testing. 


Tests, 
vaa addition to WISC (Verbal and Performance. Scales), two Moray House 
al Reasoning Tests, Raven’s Progressive Matrices, teachers’ estimates of 
upational status of the homes and of the 


Sch} 
are performance, and indices of the occ 
of Gaelic in the linguistic background of the homes. 


Rersuts, 
(i) a main findings were : 
a Gaelic translation of WIS 
howing an average score of 


C agreed with the Moray House tests and Matrices 


92 or 93 in the group. 
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(ui) Gaelic children were at a disadvantage on performance tests, and on strictly 
timed tests. 


(iii) The form of WISC sub-tests recommended for use with Gaelic-speaking 
children are : 

Verbal—Comprehension, Similarities and Vocabulary (all in Gaelic), 
Arithmetic and Digit Span (both in English). 
Performance—Picture Completion, Block Design, Object Assembly, 

Coding and Mazes (all with Gaelic instructions), 


The Effects of Treatment in a Special Class ina J 


unior School on Retarda- 
tion and Maladjustment with Special Reference to the Concept of Self 


By DAPHNE C. IVES 
(Abstract of thesis submitted for the degree of ME 


d. in the University of Manchester, 
December, 1961) 


AIM OF THE INVESTIGATION. 

This research was undertaken in order to assess the value, to eleven children of 
junior school age, of a year spent in a special class. The class was attached to a 
primary school in a Lancashire cotton town which accommodated 350 children. 
The children had been 


placed in the class for one or more of the following reasons : 
they were borderline cases for placement in a school for E.S.N, children ; or, they 


were of low-average intelligence and severely retarded in one or more of the basic 


subjects ; or, because of some emotional disturbance, they had been unable to adjust 
themselves to a normal classroom situation, 
Special emphasis was put upon changes in the self-concept of these subjects. 


Merrnop. 


aluations made at the beginning and at the end 
of the period included Sander’s tests of Social Under-Evaluation (SUV) and Non- 
Social Tendency (NST),! Raven’s Projection Test,? the Bristol Social-Adjustment 
Guide, forms 1 and 2,* together with tests of persistence and level of aspiration 
designed by the present writer, In addition, a series of controlled observations were 
using a check list during free-play periods, Items which were 
sequently grouped under four separate headings : hostility, social 
‘awal and constructiveness. All of the data so collected were 
atched-Pairs Signed Ranks test* to establish the direc- 

tion and significance of changes over the year. 

(2) Additional information Wi questionnaires which were 
completed at the beginning and en basis of separate interviews 
with the children themselves and with their uardians ; (b) questionnaires 
administered to sixty-six children who attended the school to which the special class 
was attached concerning their attitudes i n in this class ; (c) question- 
naires completed both by the special class children and by a control group taken 
from the remainder of the schoo] regarding their attitudes towards previous teachers ; 
and (d) four sociometric tests i i 


Although none of t 


readi isti : ‘aph lent itself 
readily to Statistical analysis, the information was found valuable in forming a 
qualitative picture of thee : 


a special class of this sort. 
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(3) Educational progress was measur i 

a rogr asured by standardized tests oi attainmeni 

z (iss e a a A year: In spite of what appeared to E 

lend themselves to statistical compari: aae tates E E 
iso; ri k i 

Ned lle te ce ey ae eee re owing to the fact that many of the children 


RESULTS. 

(1) Significant changes were found t 7 í A 
measured by a number a the tests mest. a ee te may ae pe 
(P< -01) as also had non-social tendencies (P< -028). Although the total sectors a 
adverse pointers listed in the Bristol Guides showed no significant change, there was 
a significant reduction in items classified U/W arii con Tene a 
and K (Knavery or psychopathic hostility to the institutions of society and a 
representatives as they impinge on the child), (P< -025 in both cases). The subjects 
displayed a more realistic approach towards future performance, as kovai a 
decrease in the final discrepancy scores calculated from the test of level of A 
T gad Observed incidence of hostility showed a marked decline during the 
ee Pe i period (P<-01) and constructiveness showed a corresponding 

E No significant changes appeared in ‘ conformity ’ as shown by Raven's Projec- 
tion Test,* in ‘ persistence,’ or in the observed frequency of interaction or incidents 
classified as ‘ withdrawn ' during the controlled observations. 

(2) The qualitative results of the experiment may be briefly listed as follows 8 

(a) The responses to the questionnaires given to the subjects indicated that 
they all had a poor self-view in relation to school work at the beginning of the year. 
Ten of the subjects showed an improvement in this respect by the end of the year. 

(b) The results obtained from the initial questionnaire given to a parent or 
guardian of each of eleven subjects, showed thatten of the eleven adults were prepared 
to co-operate with the special class teacher and were grateful that special help was 
being provided for their children, At the end of the year, eight of the parents were 
very satisfied with the social and academic improvements made by their children. 
One (a guardian) was not interviewed again and the other two had certain reserva- 
tions about the class, but were not wholly dissatisfied. 

(c) The responses to a questionnaire given to sixty-six children in other 
classes within the school indicated that some of them regarded the special class 
children as being inferior to themselves. Remarks to this effect were made in the 
playground which could have had an adverse effect on the self-concepts of the sub- 
jects. 

about their attitudes to teachers, the 
among the special class children 
categories than among 
f teachers is 


ects were questioned 


(d) When the subj i 
ments was higher 


incidence of self-referential state 
in both the ‘ Teachers Liked ’ and the ‘ Teachers Disliked ° 
the ‘normal’ children. This indicates that the self-forming influence o: 
greater with extremely unsuccessful children. 
. (e) The sociometric inquiries showed that of three 
isolated within this group at the beginning of the year, om 
integrated by the end of the year. f 
(f) Attainment tests indicated that t ie subiec® bine mae 
sufficient academic progress by the end of the able them to feel that they 
were beginning to acquire the basic skills. 


children who were socially 
e had become satisfactorily 


en of the 
year to en 


CONCLUSIONS. ' A 
i E J J imitations 
It should be emphasised that because of the small sample and the limi 
of the experimental situation, the following conclusions need to be regarded as 
tentative. 
.__, 1.—During the year of the ex 
withdrawn ’ (criterion : ten or Mo 


een designated 


periment the subjects who had b 
) became 


re U/W pointers on the Bristol Guides 
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i lative success, 
igni n be tentatively concluded, therefore, that re ; 
a a E climate, or both, help such recessive children to make a 
peels adequate social-adjustment. 


i i i i in the number of 
ame as a decrease in hostile behaviour and an increase in the. 
es og a eae oe observed in the second half of the year. This suggests 
io t providing opportunities for free interactions to children who have failed to make 
aed social-adjustment is beneficial, in so far as it helps them to function more 
adequately in a group. 


jects tended to make their predic- 
tions on the level of aspiration test more in accordance with reality. As the level of 


of success and failure (a high 


at the decrease 
subjects which 
lowing reasons : (2) most of the subjects 

i for the first time in 


ause of the oppor- 
; (c) significant people in their lives were 
encouraged to provide self-enhancing experie: 


parents was considered to be animportant factor in 
n, both academically and socially, 


In the judgment of the writer it would b 
this to be attached to schools which are orient ards individual progress rather 
than towards competition in the form of examination positions and streaming by 
ability. Competition in this form leads to a certain amount of stereotyping which is 
not conducive to healthy self-evaluation on the part of the children who are un- 
successful within the system. It seems particularly important that children with 
problems of academic failure and social maladjusti 
who are temperament. 


ment should be placed with teachers 
ally able to accept them and deal with them in a constructive 
manner, 


e desirable for such special classes as 
ed tow: 
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The Opinions of Young Workers on their own Education in a Secondary 
odern School. 


By FRANCIS S$, JACKSON 
d for the degree of M.Ed. in the University of Manchester, 
April, 1962.) 


(Abstract of thesis submitte 


ft 


va 
$ 


JP 
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THE SAMPLE. 
This consisted of 142 young men drawn from four c i 
Ee ASE 2 r consecutive one-year - 
N T Hey had left the same secondary modern school at the age of 15 aad oon 
eae o es inquiry, between 20 and 24 years ofage. They represented 83 per 
OLEA A oe s total output of 15-year-old leavers over the four-year period in 
bea a ` included a wide range of occupations ranging from undergraduate to 
A simple statistical check revealed no significant differences in age and 
y-grouping between the 83 per cent. included in the sample and the 17 per cent. 


omitted. 


THE ScHooL, 

ani mip school was a small secondary modern boys’ school serving a sea-side town 

aa a N adjoining semi-rural area, Its organization and size (two form entry, witha 

Fa e fourth form) was fairly stable during the period underreview. Thecurriculum 

Sines ee ated no particularly distinctive features and no unusual teaching methods 
1n use, 


Roe 
PROCEDURE, AND SCOPE OF THE INQUIRY. 

i The majority of the respondents (115 out 
y: the investigator in their own homes, usua 
various reasons, could not be reached in this wa 
covered the same topics as were dealt with in each inter 


of 142) were interviewed personally 
lly in the evening. Those who, for 
y completed a questionnaire which 
view. These were : 


(a) Respondent's present occupation, 

(b) Record of further education (if any). 

(c) Opinions as to the adequacy or otherwise of the school as a preparation for 
employment. 

(4) The respondent's views on the relative v: 

(e) Opinions and ideas on vocational guidance. 

(f) Free comment or criticism on any seemingly r 

(g) An over-all assessment of each subject's satis 
education on a seven-point rating scale. 


alues of various school subjects. 


elevant matters. 
faction with his own secondary 


SumMary oF CONCLUSIONS. 


1.—The majority of the young men int 3 : 
favourably disposed towards their own secondary modern school. There was a slight 
alf of the ability range to be more critical 


tendency for those in the upper h abili Aà 
(constructively) than their fellows. Extreme dissatisfaction was rare. 
association betw: 
dards. A hig! 
ments of furthe 


his four-year sample of ex-pupils were 


een ability group and strength of 
her standard of mathematics was 


2.—There was a distinct 
r education appear to be largely 


demand for higher academic stan 
Particularly requested. The require 
responsible for this. 
3 st 72 t. of the entire sample (over 94 per cent. of the ablest 
(3) Almost 72 per cent. of the enti ple f This had led, in two cases, 


quarter) had engaged in some form of further education. £ : 
to the university, and in many others to diploma and certificate courses of various 


kinds. Such achievements as these, especially in ex-pupils of so small an establish- 
ment, need to be remembered by those who would dismiss the secondary modern 
school as an academic ‘ dead end.’ 

4.—Suggested additions to the curriculum (as it then existed) were metalwork, 
a modern language, and algebra. Requests for metalwork were numerous and 
appeared to bear no relationship to level of general ability. Demands for algebra, 
on the other hand, showed a distinct association with level of ability. 
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5.—Many of the ‘ second best ’ performers felt they had not received their full 


share of attention, particularly in the basic subjects. They felt they should have 
been pushed harder, encouraged, and allowed to try some of the work which was 
evidently considered beyond them. 


6.—Music and history came in for heavy criticism as serving no worth-while 
purposein the modern secondery school. From the less severe critics came suggestions 
tnat music might be made optionaland thata new approach to history might be made 
via civics and current affairs. 


z » but some of the mi t htful 
pointed to the value of a as wea 


_— —— 


BOOK REVIEWS 


Anstey, E. (1961). Staff Reporting and Staff Development. London : Royal 
Institute of Public Administration in collaboration with George Allen and 
Unwin, Ltd., pp. 96, 12s. 6d. 

Applied psychologists who have read publications by Edgar Anste woul 
expect this book to be meaty, concise and Daa helpful. They will a be aie 
appointed. _A quarter of the book is made up of appendices showing exact copies of 
staff reporting forms in current use. In the remaining seventy odd pages Anstey is 
able to compress, in a highly readable form, balanced ‘accounts of the objectives and 
methods of staff assessment, of confidential and open systems of reporting, the pros 
and cons of Promotion Boards and some very worthwhile advice on the design of 
report forms ; he gives us the benefit of his experience in training in reporting and 
describes three kinds of staff appraisal interviews. A section which particularly 
impressed me was the five-page critical account of the forced-choice technique. 
I also liked the examples he gave of the ways in which clever reporting officers could 
subtly twist the reports so as to secure the promotion of people they liked or the 
transfer of people they disliked. Obviously, Edgar Anstey has worked with very 


intelligent people. 

A On the face of it, this book is a valuable one for Establishment Officers in 

County Halls and Ministries but not one for educationists. However, those con- 

cerned with the grading and assessment of children, of students and of teachers 
up and validation in any field . . . all these will find 


and those who work with follow- 
Staff Reporting and Staff Development a very useful handbook. DENIS MCMAHON. 


The Year Book of Education— 
) The University of 
Columbia University, 


Brrepay, G. Z. F., and LAUWERYS, J. A. (1962). 
The Gifted Child. Evans Bros., in association with 
London Institute of Education and Teachers’ College, 


New York, pp. x +541, 63s. 
Readers of this Journal will probably find this volume of the Year Book one of 
the most valuable of the series for it deals with theoretical and practical problems of 


giftedness to which society is urgently demanding answers of psychologists. The 
three major sections are headed Psychological Theories, Detection of Ability and 
Selection for Educational Purposes and Increasing the Pool of Talent. These are 
preceded by an introduction by Sir Cyril Burt which must surely be one of the most 
distinguished pieces of writing in educational psychology in recent years. Here in 
under sixty pages he broadly surveys the whole field and so orientates the reader 
for the detail which follows and provides him with a critical evaluation in advance of 
much of what appears in the rest of the book. j 

theories as to the nature of giftedness and the educa- 
tional practices likely to stem from them. of particular interest are Ruth Strang’s 
introduction, Guilford’s Parameters and oa site f zaoni sane Beige 

yi j reati hinking Abthties. 

Measurement and Development of Creative iy pase Oe iar ee eI athe 
tudies reflecting existing techniques 
which have grown up in the countries of their origin with little oS Sal to he 
psychological knowledge. Most are based upon examinations a eoqueuyy nee e 
tests of academic aptitude. Some appear unsophisticated to British eyes. ere 
are, however, some valuable accounts of researches designed to assess the efficacy of 
procedures, as, for example, in Passow’s survey and Husén’s work in Sweden into 
ey beliefs Husén 


the age at which selection should take place. Contrary to many be 
concludes that early selection of the academically gifted does not benefit either those 


selected or those left behind. ; a” 
The third section concerns attempts of modern technological communitt j 
meet their insatiable demands for people of high talents, and of rapidly developing 
countries to increase their available human resources as quickly as possible. The 


es to 
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i iliar ones of recognition followed by appropriate 
Se ae i eden ete ae to encourage bl els development commen- 
ee individual genetic limitations. There is less than one might have hoped 
SE ais Sesion of the social problems satirised by Michael Young which inevitably 
a ah segregation of the most able becomes more efficient, though Anderson does 
Romek on kem in his chapter on Dilemmas of Talent Centred Educational Programmes. 


bt of the book’s value for it gathers 


ic of giftedness and provides a very 
complete entry into the many aspects of the field. L. B. Biren. 


Creucu, M. F. (1962). Educating Older People. Tavistock Publications, 
pp. 185, 28s. 


This interesting book is an attempt to generalise the experience of the author in 
tutoring students for the diploma in teaching backward children working at the 
Institute of Education, Uni: i 


erable use 


i y light up the general picture of 
banality and do-it-yourself ps i i i i 


s the second 
half of the book. The treatment is very uneven since Dr, Cleugh seems to work at 
two different levels, as a i 


trained teachers (the 1 
case-worker who works 


ective level of analysis of 
ordinary human contact. In the days of my youth I became familiar with the kind 
of content which is now bei i ici 


atest adva: 
logical insight—except that in those d 


nces of psycho- 
ays it was described, perhaps unkindly, as 
“ stair-heed gossip.” I quote : 
“ January 29, 
Tutor: Are you keeping your head above water ? 
Mrs, S.: 


Yes, much better this term. 
Tutor: Good. Take it easy. 


Mrs. S.: It’s only just above water, but it 


; i ) is above, 
relaxed, in spile of brick on 17ih. 


(Seems steadier and more 
E” p. 129. 


1c way with the problems of educating 

older people, under the chapter- i y lecturing, discussion, 
Practical work, examinations, This part of the book might be regarded 

as a handbook for adult tut iti i Stematic. Although it 
c 1 scientific studies which 
adult earner, the: ich one would fault as 


Produce a “ Proper book ” in thi 

mere brochure of “ suggestions 

view that although the tutor is 

Serve a therapeutic purpose, Itis the attempt to beco, 

concerned about this function with which 

However, en so few books i i it 

, Publish dealing with 

iene ad ieee sg must compliment the e foe and the 

a e a nak iheibool will be ince it wi ai í 

value to many different kinds o; a manay Pen 
“ stair-heed gossip.” 


Joun McLetsu. 


a 


i 
f 


r 
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Goron, I. J. (1962). Human Development. Harper, pp. 400, 46s. 


aa = bedlg kane the development of the child from conception to manhood, 
Ricceeel go i alance between the forces of heredity and environment, and 
es Rii ah ines the physical, psychological and sociological aspects. The 
a bg os ue hest describes the first two-thirds of the book is ‘ sound,’ and this 
Deri me ns a with faint praise. Two late chapters, on “ Exceptional 
peek Be _and “The Continuing search for Identity ” are very good. In 

ral, oth effectiveness and interest tend to increase as one reads on. 

The author endeavours to study the developing child as a whole while also 


examini re i à : 
> aem anl Grs forces bearing on him at certain times in his life, and seeks to adopta 
ying theoretical position.’ The development of the self ‘is a transactional 
ial environment.” 


process between the developing organism and its changing soci 
approach of looking at the child’s home, his 
internal ” approach of looking at 


He tries, to combine an “ external ” 
Beer group, his school and his society, with the 
Bey Le peroane Hasse various situational forces, and how he perceives his own body 
all eer ay h.’ The theoretical account which is constructed on this basis is 
este by concise but informative statements of recent research. Though 
Ze re some references to British work, there could profitably have been more. 
FO the book is particularly sound there are a few statements which need 
S _examination—though one indeed has seen them elsewhere. Gordon 
adumbrates that “ The direction of (man’s) behaviour is always, in normal develop- 
ment, towards increased complexity.” The intention here is good, but one wonders 
about the word ‘always.’ When man, for example, reduces a chaos of isolated 
facts into order by classification is he not simplifying and consolidating the position, 
maybe preparing for further complexity but not increasing it for the moment ? 
Similarly, we read “ the increase in complexity and organisation of the person 


continues throughout his life.” Does this statement include old age? Much more 
ry of ‘ equilibrium ’ which Gordon 


important, however, is the widely accepted theo. 

states once again. The human organism “tries to maintain a steady state, i.e., 
it attempts to maintain itself and to balance the forces within itself.” While there 
is a central truth here the words do not describe exactly what is happening, and 
Gordon (and many other writers) contradict themselves when they maintain that the 
organism ‘‘at the same time develops towards a new steady state.” The reviewer's 
position is that the developing child must contain, if healthy, a tendency towards 
greater maturity, towards development, therefore ipso facto it cannot be ina ‘ steady 
state.’ Physiologically, it is obvious that the child contains within himself the 
tendency to change, and this is also true psychologically. This often repeated 


concept of ‘ equilibrium ' needs a much deeper examination and a much more 
thoughtful comment. R. R. DALE. 


nd Adolescence. Pelican Books, pp. 286, 5s. 


There are few books of child psychology, covering development from infancy to 
adolescence, which can be recommended confidently to parents who lack specialist 
knowledge of psychology. Dr. Hadfield has provided such a book by writing for 


“ ordinary intelligent parents of ordinary intelligent children.” He has drawn on his 
experience as a psychiatrist to give ma Idren’s behaviour and 


ny examples of chi 1 
dfficulties, and to make recommendations to parents about the treatment of children. 
These are founded on his belief that the fun 


damental need for normal development is 
a sense of security. It follows tha 


t he gives little emphasis to sexual problems as 
primary causes of neurotic symptoms ; he considers that mental health in childhood 
depends on parental love, on discipline which combines kindness with firmness, and 
on freedom from the inhibiting effects of unnecessary restrictions Or i over-protec- 
tion.’ Some readers may feel that genetic constitution and social environment = 

r, he s 


play an important part but, in so far as Dr. Hadfield considers the latte: 
that a sense of inferiority and competitiveness play minor roles in the development 
of most children. 


HADFIELD, J. A. (1961). Childhood a 
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i iscussi ' problems in bringing up child- 

vi n introductory discussion of parents pro 1 es 

AE ba to four describe the stages of maturation, phases in early deve oP: 
eae and the organisation of personality during childhood. In chapter five emotiona 


development in adolescence is described at some length. More serious neurotic 
disorders are dealt with briefly in chapter six and, in 


a last chapter, several ‘ golden 
thood ’ are suggested. ¥ s 
AE E Pier E of allowing children to live fully through 
each phase of development, ex i 


Ruru BEARD. 


Behavior and Development 
r and Bros., pp. ix+375, 46s. 


This is rather a charming book which one would wi 
(rather than order for the L; 


ar print and 
i ion, nsidering the 
vide variety of topics. ted in fifteen 
ections and sub-divisions. The integrity of approach, 
, impose their own unity, an stem only from being 
3 vith the subject-matter, and having masterly powers of selec- 
tion. The book is 
which referenc 


dand motivating to 
apters. The 
included (e.g., for Ch. 9, “ i i 


ir natural habitat— 

reader who expects to 
ar from 5 to 12 will not find this, except for a 
th Gesell’s accounts) 
hool-age,’ as a whole, 


very condensed “ Summary of age characteristics 
in Ch. 14 (pp. 315-330). Th is i 


y ial or seminar kind, either in 
, adult evening classes, or in universit 

elopment with a social 

chapters is a list of “ ese are though 
exercises not too onerous to be carried out with enjoyment in limited ti There is 
a chapter (13) which gives details and princi or cae 
children’s beh viour. Appendix A and A 
selected Pamphlets, each w 

cence, a look ahead ” with 
ourse 


ere is also a specific 
: “ The understand- 
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A lively feature is the introducti i 
they $ 5 d r on of quoted extracts, sometimes from child. 
ae aie fe Se; eee jokes and riddles (p. 227), or an 11-year-old ating 
chilier bene (P. ): “ There are some teachers who have just gotten tired of 
mere ce use the bothersome things they do, and do not like any of them any 
ahegeiie ae pe do not help you learn very much..." The authors of this book are 
steed SA a his category. “ Being around children,” they say (p. viii) “ may 
nico r goy, but it keeps the heart young.” It is pleasant to be able to 
SERT ore aps of high academic standards written with enthusiasm in a manner 
Da he needs of the less academic and of those students and professional 
rs who would benefit from a moderately short, gay but not superficial text for 


this relatively little studied 5-12 age. L. R. BOWYER 


HER, KARL, and STEWART, W. A. C. (1962). An Introduction to the 
ociology of Education. London: Routledge and Kegan Paul, pp. xvii-+ 


187, 21s. 


ae This work is chiefly of historical interest. 
J si no one would claim that it was the best. Its fifteen chapters, arranged in five 
; ctions, are uneven in value and loosely strung together. It purports in its title and 
ntroduction to constitute an introduction to the sociology ofeducation. It does not. 
No beginner who read this book would, in fact, have been introduced to the major 
pre-occupations, problems and achievements, still less the methods, which character- 
ize educational sociology in the mid-twentieth century. 
One-third of the book is de 


É voted to psychological topics : resumés of the more 
obvious and unsophisticated theories of learning and personality development. 
There is some advance beyond associationist theory : the laws of recency, frequency 
and effect are rehearsed, but attention is given 


to the social context in which. learning 
takes place, to the assumption of roles within social institutions. This is both too 
little and too much. It is doubtful whether the beginner would make much of a 
passing reference to Allport’s theory of * functional autonomy ’; 


for others it is 
really rather old hat. The chapters on so nality development 
draw heavily on G. H. Mead ; they are wri int of the contribu- 
tion which the cross-cultural studies of socia 


made to this 
field of inquiry. 

The first third of the book and the last deal with 
properly sociological. It is interesting, however, that in 
discusses teaching, training, and instruction, the temptation to explore the socio- 
logical implications of these distinctions is firmly resisted. The sociological chapters 
contain much that is platitudinous and trite. An ambitious survey of the ideals of 
the Graeco-Roman, Hebrew, Medieval, Renaissance, and post-Renaissance worlds 
illustrates the point that different ages modify the ideals (for example of ‘ the 
gentleman ’) which they inherit, and that ‘ current attitudes have historical ante- 


cedents.’ 

Nowhere in the book are we introduced to the sociology of education as an 
experimental science concerned with the rigorous empirical testing of hypotheses. 
In the final chapter, it is true, a number of suggestions are made for research : a 
“survey of the existing educational institut i 
relations...’ ; the factors in modern life (work an ke fo 
disintegration of the personality ; the proper recruitment and role of élites if the 
cultural decline of our society is to be halted ; ‘ the balance betwee: 
tions of the home, church, school, factory, and public life towards 
individual character.’ There is little apparent awareness of the formidable tech- 
nical problems involved in setting up inquiries which might yield answers to these 
great questions. This reviewer remains unconvinced that “ the right command of 


primary sources and of the existing literature on the subject already offers answers 
to a great number of them.” F. MusGROVE. 


While it may be the last of Mann- 


cial influences on perso: 
tten with only a bare h 
1 anthropologists have 


matters which are more 
a chapter which usefully 
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i ) j isabili rch Needs in 
7 Edit.) (1962). Reading Disability. Progress and Research 1 ( 
aioe. eines Johns Hopkins Press, pp. x+222, No price given. 


ublished here were submitted at a symposium held in the Johns 
Hopline Wedicat Institutions in 1961 under the auspices of the Association for = 
Aid of Crippled Children. There are thirteen symposiasts drawn from the hi a 
fields of education, psychology and medicine. In itself, this is a reassuring sign : 
medicine and paedagogy have been too slow in coming together in joint studies o 

1 interest. 

ee in the field of severe reading disability w arprii 
dramatic break-through is reported or that Howes writes “ the field is still at a stage 
where we need to find out exactly what dyslexia is as well as why it occurs.” It is 


noteworthy, however, that in spite of the o by such as Torpe and 
Malmquist (who were not at the Confe t is expressed as to the 


developmental dyslexia. It is seen as a dys- 
n language and communication resembling the sequelae of some 


ill not be surprised that no 


brain injuries. 

Most of the familiar topics are de 
the most recent work and there are 
educational side there would seem to 
enology, or to recognition and treatment of the condition, 
we still await answers to the questions : What is the aetiolo: 
an early diagnosis? \ 


What are the best 
answers are unlikely to arise from furthe 
and factorial studies, 


alt with and brought up to d 
some topics which will be u 
be little new to report w. 


ate by reference to 
nfamiliar. On the 
ith regard to phenom- 
As Schiffman indicates 
gy? How can we make 
s? It seems clear that 


» The latter half of 
ith such studies though the Papers are largely statements of 
reporting of experi- 
Birch, in his paper on Dyslexia and the Maturation of Vis 
that dyslexia is not itself a problem c i 
derive from a variety of aetiologies. 
worker to decide at the outset which he wishes to explore ; 
but, having made a decision, this does not mean that he regards o 
unimportant 


. It is the main value of this book that workers w. 
decisions of this kind have place: i pinions side by si 


de so that each 
can be correlated with all other: ard seen a little le 


ss obscurely. 
It follows on and supple- 
ness in Reading bringing 
L. B. BIRCH. 


s and the way forw: 
This is esse: all workers in this field, 
ments M. D. V 


S M ) definitive volume Backward 
the bibliography up to 1961. 


Scorr, W. A., and WERT 


HEIMER, M. (1962). 
Research. New 


York : Wiley, 60s. 


The writers of this book have set out to provide help to those w 
upon psychological research for the first time, They use the word psychological 
broadly and what i l the social sciences, 
ere are sı an anathema. They would aver 


Leis ie embark upon it and that without 


such a p O research is to 
a process ledge acquired by the theoretical study of ced- 
pide al s y Ol research proc 
ures ls ephemeral and je: ets the position in which the new- 


finde ni June. But this view forg 

highly sophisticated re Snores the fact that Psychological research techni e 
eh et “e> ATE Not so easi] i i í lds 

and exist in considerable variety, ees ee ae ee st 

fact that, for many new 4 


a research workers, it j i 
of problems which often Provides orj cere 
Proposed studies, 


Introduction to Psychological 


ho are embarking 
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The need for the guidance which such i i 
be g h such a book can provide would almost certainly 
ee bi who have numbers of higher degree students passing though 
ode ee a y the attempts to provide it have been rather few. Of these, the 
eniT e Sis which has yet appeared. It deals in a co-ordinated manner with 
Hi Pres one would expect, with the processes of planning and design in fair detail, 
be ie gr hniques of instrumentation, verbal, observational and experimental, 
ee Ree P Te and with analysis, interpretation and reporting of results. Through- 
AA manan of experience is clearly revealed and the discussion is informed, up to 
see and most clearly laid out. Yet, even so, if the purpose of the book was to 
m and guide the beginner it might still have been better. It tends, as do all 


books i i i i 
ks in this field, towards a discussion of what research is about, as if it were a 


discipline in its own right, rather than concerning itself with how to do research, 


wee it is ever possible to describe this. This is, of course, a simplification and 
nae y R which in some detail is unfair to the authors. To several of their 
£ p ers t ay have added supplements which deal with the ‘ how ’ rather than the 
ins y ang these chapters, and not only their supplements, are among the best in 

ook. Yet, too much of the 400 pages of text is taken up with discussion 


hidh is excellent for the relatively sophisticated but which is inappropriate to the 
person seeking fairly specific guidance during his novitiate. One suspects at times 
d yet informed guide book to the 


that the writers wanted to present a straightforwar 
his way by the fear 


neophyte but were deterred from writing the whole book in t 
of being thought unscholarly. If this were so, it was an unnecessary fear both from 


the viewpoint of the subject matter and from the standing of the authors themselves. 
Had they written more specifically a guide book in the way indicated, they would 
surely have found space to deal, inter alia, with questionnaires, to which so many 
beginners are attracted, with a treatment of non-parametric techniques in sufficient 
detail to give the newcomer some orientation and, surprising omission, with factor 
analysis in outline and with the types of study amenable to it. If space was short 
they could well have omitted the twenty-five pages of standard tables. 

But these observations, notwithdstanding, the book remains the best of its type 
and should certainly be in all libraries, in the hands of those sophisticated enough to 
skip some parts and to benefit from others and with anybody else who is not deterred 
by the rather high price ticket which American books seem to attract in this country. 

G. E. R. BURROUGHS. 


VENESS, THELMA (1962). School Leavers. London : Methuen and Co., Ltd., 
pp. xxv-+252, 25s. 
This book is concerned with young people’s outlook, chiefly on their own 
personal future. What do they expect from life in its two main spheres, family and 
work? In particular, and this is the problem which first stimulated the inquiry, 
what are their ambitions ? 
W The study was carried out with 1,300 boy 
Vest of England, and the other in the vicinit 
technical and modern schools, with ages rangin; 
on leaving age. The methods employed were, 
of the ' grape shot’ variety : questionnaires, interv] 
on. The underlying assumption, shared by the reviewer, is { 3 
research a multitude of relatively coarse approaches is more likely to yield a rounded 
Picture than a single approach, however thorough. The key technique on which the 
author relied most heavily—though without omitting frequent checks from different 


Sources—is the ‘ restrospective autobiography.’ The writer was asked to imagine 
himself at the end of his life, and revi s technique, 


a ew it from that perspective. Thi 
which was previously used by Stoetzel in his study of Japanese youth, produced a 
rich harvest of fascinating material. 


in two counties, one in the 
n. They were in grammar, 
g from 13-plus to 17-plus, depending 
to borrow a phrase from Wilkins, 
jews, paired comparisons and so 
is that in certain kinds of 


ys and girls 
y of Londo 
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Some aspects of ideas about marriage and the family may be cited by way of 
illustration. Current trends are reflected in the favourite age for marrying, between 
20 and 25. Although it is not directly stated, one might infer from the quotations a 
remarkable absence of the romantic notion of marriage (remarkable because the 
prevalent stress in the mass media) : “ This job I stayed at until I got married at 
the age of 20. I had a good husband and we never quarrelled.” Itis mostly rather 
matter-of-fact. This also applied to the references to children ; there was little to 
support the view that frustrated aspirations tend to be projected on to fantasy 
children, and scant emphasis on education as a goal. Very few mentioned separation 
or divorce, but a curious feature was the anticipated high mortality rate of spouses. 

Attitudes about work, knowledge of jobs and job choices are extensively dealt 
with, and this section of the book wi.ll be found most useful by those concerned with 
the practical problems of school leavers. The author introduced an interesting 
analytical categorization of job-choice orientations in terms of ‘ tradition-,’ ‘ inner- ' 
or ‘ other-directed,’ which deserves further development. 

The findings concerned with ambitions have aroused most widespread interest 
and led to the vociferous airing of prejudices. A common reaction to the book was 
that it reveals the youth of today as dull and unenterprising ; had the results been 
in the opposite direction, the line would no doubt have been that modern youth is 
aiming too high, with inevitable later frustration. Here it will only be noted that the 
statistical analysis of ‘ ambitiousness ’ items has as its outcome a scale on which 
technical school boys rated highest. The problem is of course,, a difficult one, and 
the author herself did have some doubts on this score. It is, perhaps, questionable 
whether any single scale can satisfactorily measure ‘ ambitiousness ’ across the whole 
range from unskilled to higher professional work. Another factor, which should be 
kept in mind with regard to all the comparisons made between schools, is that a 
majority of the grammar school pupils investigated were somewhat older than the 
rest. 

These comments are only possible because extensive and detailed tables are 
provided. Readers tempted by the lively style to skip perusal of the tables should 
resist the temptation—they contain much of the meat. Altogether the book has a 
great deal to offer to everybody concerned with young people, but perhaps most of 


all to educationists (a sample of their products) and to youth employment officers 
(a warning that what they are apt to regard as raw material is already partly 
processed.), 


G. JAHODA. 


Vycorsky, L. S. (1962). Thought and Language. M.I.T. Press and Wyley. 


_ This is an English translation of a book written getting on for thirty years ago 
which appeared again in Russian only as recently as 1956. It shows remarkable 
insights into the nature of thinking, and, in its essentially functionalist approach, is 
an appropriate work to put before the English reader to 


day. 
After presenting his problem, which is the growth of thought as shown in the 
construction of concepts, Vygotsky then discusses Piaget's and Stern’s views of 
language development. The comment upon Piaget is, of course, based only on his 


earlier works and the publishers include with the book a separate paper of some 


thirteen pages, setting out comments made by Piaget in 1962. Vygotsky is critical 
of the extent to which Piaget's theory leans on the concept of maturation. However, 
Vygotsky's own contributions form the 


Hl really important parts of the book, particu- 
larly as set out in Chapters 5,6 and 7. In Chapter 5 he describes how conceptual 
grouping by children grows from heaps having no objective basis, through com- 
plexes with no logical basis but possessing some of the qualities of concepts, finally 
to true concepts. In the next chapter, on the development of scientific concepts, Be 
has much to say to the educationalists, in particular contrasting spontaneous with 
non-spontaneous concepts. The latter he exemplifies from school work in science 
and modern language learning, and shows how the two types of concept tend to 
come together in the course of learning. Finally, there is a thoughtful discussion 


— A 
= SS Oe 
oe | 
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of thought a ji i i 
Oache at aa we i heer the growl of speech and thinking are considered 
menk s views on the socialisi i i 

< a checked and criticised by e E Sprea E as 
thinking } this pook £ a ' must ' for workers in the field of the growth of children’s 
EA rena AS pera Those who know the publication by Piaget and Inhelder, 
pidhi resented ade logiques elementaires, will also recognise how similar is the 
D ike e e SSA Heap 4 of the growth of concept formation to that set out 
Psychological beni ta their Sat re The book is, however, more than a 
DSR en oh ofi po ers a method of education, and eos be 

i z. A. PEEL. 


WISEMAN zdi nais 
SEMAN, S. (Edit. 1961). Examinations and English Education. Manchester : 


Manchester University Press, pp- Xx+ 188, 21s. net. 


In 1959, t iec 
for its EE ah chosen by the School of Education in Manchester University 
told, several R ue ectures was the problem of examination, There were, we are 
Schools in: eza asons for this choice—the growing interest of secondary modern 
taking ae Sige of.an external type, the feeling that the time was ripe for 
Secondas oS ere changes in examinatious which affect either primary or 
ing and oh ools, and finally, the recent developments in the techniques of examin- 
The pre e need for relating them to the purposes the examination seeks to fulfil. 
fect at volume, edited by Professor Wiseman, is an expanded version of the 
dealin s delivered on these topics, with an additional chapter from Professor Oliver 
nemo especially with the recommendations of the Crowther Report. 
tions? = a short introduction, the series opens with a ‘ historian’s view of examina- 
y Mr, Norman Morris. This is followed by lectures on examinations in 


relati 

aon o tas grammar school, the modern school, and the primary school, respec- 

Mr. Lu y r.Petch (Secretary to the North Universities Joint Matriculation Board), 

S (formerly Secretary to the Associated Examination Board), and Dr. 

efficiens n. Dr. Wiseman also summarizes the methods and results of assessing the 

educ cy of examinations ; and Professor Oliver discusses ‘ examining genera 
ation,’ and then winds up the course with a paper on the problems of education 


and selection. 
eae five contributors are conscious that both selection and examinations have 
if the imitations and their drawbacks, and therefore, call not only for some defence 
to b y are to be retained, but also for certain improvements if current criticisms are 
emet. They are, however, equally convinced that in recent years criticism has 
the objections to 


pon pressed too far. At bottom, as Dr. Wiseman observes, 
ection, and particularly to selections by means of intelligence tests, are “ social 
” But the familiar 


and political in origin rather than educational and psychological. 
ed by sociological writers, like Dr. Floud and 


pos have been strongly support 
egy & In their book on Social Class and Educational Opportunt 
isti at measured intelligence is well known to be largely an acquired c 
Pa ic,” and therefore unfair to children of the “ unprivileged classes. m 
rahe Dr. Wiseman (quoting Dr. Warburton), is rather like declaring that “ the 
ena Channel is well known to be wider than the ‘Atlantic Ocean. , No one, he 
ds, who has really studied recent research on the subject “ can possibly deny the 
i + and, as actual records 
elped to do fuller justice to 


ay substantial genetic component in tested intelligence hs: 

thaw, the introdnueiian i subh tests has undeniably, helpe oji 

Gary who are socially or culturally handicapped. The intelligence test must be 

Sike if we are to get anywhere near equality of opportunity ; and we would be more 
ccessful in reaching this objective if the test had greater weight.” 


50 netoen Oliver defends much the same point of view. From Plato onwards, 
e reminds us, philosop 1 that educational systems have a 


dual fı x hers have maintained thé r i 
Overs unction—to train and to select; and there is the inevitable temptation to 
aie emphasize one function to the neglect or impairment of the other. Current 
ussion, too, has tended to forget that efficient selection implies and facilitates 
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social mobility. Accordingly, in the interest of the community as well as of the 
individual, it is to be hoped (he says) that ill-informed attacks on examinations and 
selection tests will be stoutly resisted. The Crowther Report “ tries to escape the 
necessity of selection by reviving the supposed distinction between selection and 
qualification.” The G.C.E. has come to serve both purposes; but the Report 
proposes to revert to a separate examination for each. Professor Oliver holds that 
the distinction is unrealistic : if the number qualified exceeds the number of places, 
we must select; and we ought, so far as we can, to select the best. Hence, he 
argues, the Crowther Committee “ would have done better to address itself to the 


improvement of the G.C.E.—the only public examination at this level which has 
achieved national currency.” 


How such improvements might be achieved is admirably outlined by Dr. 
Wiseman. Hitherto, he believes, “ educational psychologists have tended to ignore 
the wider educational implications of their work, while the teacher-examiners of the 
traditional boards do just the reverse ” - they overlook the need and the possibility 
of increasing the sheer efficiency of their techniques. In the United States, ‘' com- 
parable boards have research sections engaged full-time on such problems ” : and 
he rightly urges that there is pressing need for similar studies over here. He sums 
up the whole book by saying that “good examinations are both useful and desirable—-- 
without them education would be much less effective than itis: let us, therefore, 
make sure that the examinations we do use are efficient in the fullest sense.” 


CYRIL Burr. 


WILson, Harriet (1962), 


Delinquency and Child Neglect. George Allen and 
Unwin, pp. 195, 25s. 


There have been numerous studies of the relationship between family back- 
ground and juvenile delinquency, but fey 


w of these have been studies in depth of the 
‘ problem family ’ in which the children suffer some measure of neglect. Dr, Harriet 
Wilson’s thoughtful contribution to our knowledge in this field is, therefore, to be 
welcomed and merits careful study. Basing her book on an enquiry into a group of 
problem families in an industrial city, she sets out to define what is meant by the 
“problem family’ and to discover to what extent such families are the breeding 
ground of delinquency. 
After discussing recent trends in studies of juvenile delinquency (she surprisingly 
omits reference to some important work of relevance, for example that by Stott, in 
this country, and Hewitt and Jenkins in the United States) and making some 


interesting comparisons between British and American culture-patterns, Dr. Wilson 
goes on to describe the patterns 


of living of fifty-two problem families, selected as 
her sample on the basis of performance-inadequacy, under the headings of solvency, 
health and education. She finds that the most disabling feature of the problem 
family is its social isolation, while emphasizing that these families are not, as is 
commonly assumed, a homogeneous entity. Delinquency is part of the pattern of 
juvenile behaviour among families exhibiting signs of child neglect, the main factors 
dominating the children’s lives being poverty and weak relationships with their 
parents and other adults. The chief characteristic of the juvenile delinquent from 


the problem family is his complete unawareness of the moral significance of his 
actions. 


The book is refreshingly free from sociological jargon, and tries to avoid un- 
warranted generalisations. Dr. Wilson has kept an open mind throughout her 
i i , careful about methodological considerations, though 
the children have been studied less systematically than the parents. For example, 
t the level of the children’s intelligence are drawn from 
> and, being partly based on group tests given at the 
infant school stage, hardly reliable ; and the author's reason for obtaining teachers 

i nstandardized form seems inadequate. Further- 
more, can one justifiably infer the 


A presence or absence of maladjustment from the 
correlation between 1.Q_, attendance, and educational attainment (p. 73) ? 


ee ee 


. 
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A slight re-a wes z z 
ooudusiees week of the material might have made the main findings and 
are a necan] — clearly from the mass of detail; and some of the tables 
Dr. Nilson saa obscure. There is, however, likely to be little disagreement with 
where shë spe gh and with her practical proposals in the final chapter, 
Rr a oc s5 E preventive action, measures which would help the father 
better off vou paar family (so that he would no longer be financially little 
dependent on ring pin if unemployed) as well as aid those families permanently 
ingkéase it Ra sis — from the state; realistic family planning methods ; an 
of specialized number of nursery schools ; and the provision by local authorities 

alized supportive help of a personal kind. MAURICE CHAZAN. 


YATES, AUBREY j 
per et Ja (1902). Frustration and Conflict. London : Methuen, 
With eG that this book is not intended to be “ an original contribution 
Imnowledwe: 4 apon but rather a systematic examination of the accumulated 
extensively ae ae Brees of frustration and conflict.” These topics have been 
tenet Hes os q by psychologists, both clinically and, in more recent years, 
Buds EER oH ' r. Yates is concerned primarily with work of the latter kind, 
he hawt ce isefu bibliography of well over three-hundred items shows how widely 
T nevertheless, cast his net. 
very tho book comprises a brief introduction and five main chapters. The first is a 
on frust rough review and critical analysis of the work of Maier and his colleagues 
Maier’s acon and fixation, The author frequently comments on the neglect of 
lastis s findings by other psychologists and remarks : “ There is no doubt that the 
vieio” value of his work has been obscured by sheer prejudice and a priori con- 
lone S; He, nevertheless, admits that Maier's results and theoretical interpreta- 
s are still difficult to reconcile with other contemporary approaches. The second 


cha . P $ $ 4 k > 
Doles provides a detailed examination of the frustration-aggression hypothesis of 
ard, Miller, eż al. and of the very considerable number of experiments which this 

k in this field has proved “ extremely 


has pri r 
as provoked. The author’s verdict is that wor 
of studies on frustration and regression. It 


(eel Next comes an account ¢ 

ein an exhaustive critique of the famous experiment by Barker, Dembo and 
repeat t Dr. Yates notes how extraordinarily few attempts have been made to 
cond, 4 their findings, a need which is surely pressing, since the original enquiry was 
ung ered without a control group and the statistical treatment of data was often 
ack isfactory. The fourth chapter is on confict. Short sections are devoted to the 
pea of Luria, Lewin and Cattell, but the main analysis is concentrated on the 
digs nsive work of Miller and his colleagues. Some most ingenious experiments are 
pe in this section, including a particularly brilliant zour de force, by Murray and 
Peers Anal chapter is entitled “ Frustration, Confl 
patys Here Dr. Yates tries to draw the strands together, and at once the 
t fy lence of Hullian learning theory 1S apparent. Recent efforts to incorporate 
ie ustration ’ into learning theory as an intervening variable are described, as are 
Many experiments relating to these attempts. Eysenck’s theory of personality, 


elt of course, very considerably influenced by Hullian theory, is also favoured, 
and hypotheses are put forward | conflict with neuroticism 


and inking frustration and 
introversion-extraversion. 
ab As a whole, this book is to be highly recommended. Whatever one may feel 
out the author’s theoretical leanings, serious attempts to test rigorously theories 
and should be encouraged. 


al z f s 
bout frustration and conflict are of very recent origin 


r. Yates’ foremost contribution has been to provide a careful, objective and wide- 
d often exciting, current research. For the 


rangi è É 
anging digest of this interesting, an 1 
‘led consideration of the effects of punish- 


ed ; ` a 
ucationalist, a useful feature 1S his detail J 
reatly under different conditions. More generally, 

g serious 


ment ? 
t ‘alee are shown to vary 8re ; 
rese: ook provides an essential basic reference both for those contemplatin c 
arch into frustration and conflict and for all advanced students of these topics. 
G. W. PILKINGTON. 
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THE REDUCTION OF SOCIOECONOMIC BIAS IN 
INTELLIGENCE TESTING 


By R. S. MACARTHUR anv W. B. ELLEY 
(University of Alberta) 


SUMMARY, This study was designed to investigate the extent of socio- 
economic bias in selected intelligence tests, with a view to identifying and 
constructing instruments which might better estimate the intellectual potential 
of pupils from deprived cultural and educational backgrounds, For such 
pupils the conventional intelligence test is unsuitable as a predictor of later 
educational progress if adaptive treatment reduces the effects of their educa- 
tional handicaps. : 

Nine promising culture-reduced tests and subtests were administered to 
a representative Edmonton sample of 271 twelve and thirteen-year-old children, 
and the results analysed in relation to measures of socio-economic status, 


verbal intelligence, achievement and school marks. City-wide testing surveys 
conducted during the previous four years provided data on the same children, 
The results supported the hypo- 


making longitudinal comparisons possible. 

thesis that it is possible to measure a broad component of intellectual ability 
with significantly less cultural bias than is found in the conventional intelligence 
test. Raven's Progressive Matrices proved the most useful test in the battery 
since it showed high g loading, consistent and minimal relation with socio- 
economic status, no evidence of cultural bias by items, and moderate correla- 
tions with school marks. The Cattell test of g and the Lorge-Thorndike 
Figure Analogies subtest showed similar promise, since they met most of the 


criteria of the study in shorter testing time. 


J.— INTRODUCTION 

INTELLIGENCE tests are frequently evaluated by mechanical investigation of 
their predictive validity in relation to academic criteria. Usually underlying 
Such an empirical approach is the assumption that only one or two pre-arranged 
and rather standard treatments will occur between test and criterion judgments. 
Cronbach and Gleser (1957) have drawn attention to a distinction between 
fixed-treatment and adaptive-treatment situations in decision-making based 
on the results of mental tests. In the former case, treatment is specified a priori, 
and maximum utility is related to predictive validity coefficients ; individuals 
are selected or placed simply in accordance with their promise 1n relation to a 
Predetermined criterion with predetermined treatments, and little regard for 
the qualities underlying the test is necessary. In educational prediction, the 
Intervening treatments are usually limited by a relatively fixed curriculum and 


relatively standard teaching techniques. f 
vever, atypical individuals may be 


_ In adaptive teaching decisions, howevé d 
assigned quite different treatments according to their present status and 
mdicated potential. In an educational setting, immigrant children may be 
§lven supplementary English coaching, children from deprived backgrounds 
may benefit from remedial reading, and sensory-handicapped children may be 
offered special aids. Educational philosophies which emphasize the develop- 
ment of potential to its maximum require serious consideration of adaptive 
rather than fixed treatments, and to aid in such adaptive treatment, the use of 
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mental tests which can provide estimates of potential ability, divorced as much 
as possible from inequalities in cultural background. The conventional verbal 
test of intelligence, designed as it is for subjects raised within a specific cultural 
range, has limited suitability for identifying potentially gifted children from 
deprived or foreign backgrounds, or with restricted formal education. The 
writers of this article suggest that various non-verbal tests for use in adaptive 
treatment settings have received insufficient research attention. 


It is further suggested that the choosing of a test to help identify potential 
for attainment of distant and general criteria, assuming the possibility of 
intervening treatments appropriately adapted to realize that potential, can be 
facilitated through the ‘ construct validity ’ approach advocated by Cronbach 
and Meehl (1955). The evaluation of tests of educability should place more 
emphasis than is customary on theoretical analysis of their relation to the 
concept of intelligence as developed by experience, with proportionately less 
emphasis on their ‘ cold-blooded ’ predictive validity under limited fixed treat- 
ments. If potentially suitable tests can be examined in the light of hypotheses 
generated from an adequate theory of intelligence, some progress may be 
anticipated in the development of culture-reduced tests suitable for classifying 
and selecting children from diverse cultural backgrounds. 


It was the purpose of the present investigation to identify tests or subtests 
which, for a sample of 12 and 13-year-old pupils from a variety of socioeconomic 
backgrounds in Edmonton, Canada, could on the one hand, be regarded as 
valid measures of intellectual ability, while on the other hand, could demonstrate 
a minimum of socioeconomic bias. Parallel studies with children from different 
ethnic and language backgrounds are now in progress (MacArthur, 1962). 

II.—PREVIOUS RESEARCH. 

Binet himself realized that his measuring scales were sensitive to ‘ scholastic 

and extra-scholastic acquisitions’ as well as ‘intelligence pure and simple ’ 
(1916). Further analyses by Stern (1914) and Burt (1921) showed significant 
social class differences, particularly in the more verbal items of the Binet scales, 
while Gordon (1923), Wheeler (1942), Baldwin, ef al (1930) and others have 
shown that the conventional intelligence test was unsuitable for children without 
‘formal education, from isolated communities or from rural districts in general. 

Amongst the causal interpretations made of social class differences in 
measured intelligence major emphasis has usually been placed on hereditary and 
environmental influences. While not denying these, it is considered that the 
importance of test bias requires further study. Eells (1951) has produced strong 
evidence to support the contention that bias in intelligence test items is common, 
and that it tends to favour upper-middle class children, From his Chicago 
surveys, he concluded that verbal tests and items show significantly more 
variation in discriminating between social classes than do non-verbal. 


Havighurst, e¢ al (1944, 1947) also demonstrated that the extent of differ- 
entiation amongst social classes depends on the trait measured, the greatest 
discrepancies appearing in verbal and numerical tests, the smallest in spatial, 
mechanical and other non-verbal tests. 

These research studies are, unfortunately, open to the criticism made by 
Lorge (1952) that analysis in terms of social levels reveals nothing about the 
ability of tests to differentiate between intellectual levels. Both dimensions are 
important in test evaluation. Moreover, Eells used only one non-verbal test 


whose reliability was suspect (Tyler, 1953), while Havighurst’s studies were 
based on relatively small samples. 
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Bama (lees ‘ee eee as fare of Cattell ( 1940, 1958) and the Davis-Eells 
Ui studis decon Lo a lequately ulfilled thei: designers’ intentions according 
Stilwell NSE ribed by Anastasi and Cordova (1953), Drake (1955), Smith (1956) 
Aen A ) and Love and Beach (1957), although the evidence has sometimes 
Further He si phe (Marquant and Bailey 1955, Cattell 1958, Fowler 1955). 
ete evaluation of such tests using a construct validity approach in a variety 
R Sine seems necessary in order to assess the usefulness of such measures. 
ica the present investigators are seeking to identify tests suitable for a 
Spor y k cultural and educational backgrounds, it was considered that Cattell’s 
a h to the problem of reducing cultural differentials by holding them to a 
aa aun would be more suitable for this study than the Davis-Eells approach 
Ee nema) them constant. On such an assumption, this investigation was 
a to study tests similar to Cattell’s in an attempt to identify test items 
= 2 tap fundamental processes of intellectual ability while holding environ- 
ental differences to a minimum. 
foun illusions are held about the possibility of eliminating cultural influences 
cents a pencil-and-paper test. Every culture stimulates the development of 
ae ain abilities at the expense of others, and the idea of aa intelligence test 
a ng culture-free in the sense that it measures the products of heredity quite 
part from the impact of cultural experience cannot be taken seriously. The 
Present study aims rather to identify and evaluate tests which reduce the 
dependence on acquired knowledge in order to make possible the improvement 
of mental testing for children whose backgrounds do not fall within the required 


Tange assumed by the conventional verbal test. 


III.—THE CONCEPT OF INTELLIGENCE. 
Over the past several decades the concept of intelligence as general innate 
ability has been subjected to a barrage of criticisms. Factor analysts such as 
Thurstone (1938) and Guilford (1956) have disparaged the idea of a general 
actor; environmentalists such as Halsey (1959) and the Iowa investigators 
Stoddard and Wellman, 1940) have questioned the notion that intelligence is 
Predetermined or fixed; developmental research pioneered by Piaget (1947) 
Suggests that intelligence is intimately bound up with the continuous interaction 
between organism and environment, while the possibility of critical periods and 
irreversible influences in mental development is now seriously held (Hebb, 1958). 

. „One beneficial effect of these criticisms has been to draw attention to the 
distinctions drawn by Hebb (1949) and Vernon (1955) amongst Intelligences 

»BandC. Nevertheless, behavioral definitions of intelligence are as contro- 
Versial as ever, and the operational approach of defining intelligence in statistical 
terms appears the more profitable course ofaction. Such a position was adopted 


in forming the conceptual framework of this study. 
Vy, The Trerarehical. conception of intelligence described by Burt (1949) m 
ernon (1950) is regarded by the present authors as the most useful model for 
assessing intelligence in diverse cultural settings. The general factor is a 
Crucial construct for assessing the extent to which culture-reduced tests are 
Sampling the common denominator of abilities required for success in a variety 
of mental tests. It looms so laige in psychometric surveys of heterogeneous 
children that it is difficult to ignore, while the advantage of distributing its 
Variance over a number of lower-level factors has yet to be demonstrated even 
With culturally homogeneous children. In predicting to distant and general 
Criteria, when adaptive treatments are possible, ‘ pure’ factor tests, are seldom 
Superior to general intelligence tests, while the amount of time and money 
Consumed on the former is an obvious administrative disadvantage. 
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The question of the innateness of the general factor can be circumvented 
by insisting only on its comparative stability in relation to lower-order factors. 
Evidence to support such stability comes from various sources. Woodrow 
(1938) demonstrated that the general factor was relatively unaffected by differ- 
ential practice in a thirty-three variable analysis, while group factors changed 
considerably. Hofstaetter’s (1954) factor analysis of the longitudinal mental 
development of children in the Berkeley Growth Study revealed a stable general 
factor increasing in size between the ages of 4 and 8, and remaining steady 
throughout pre-adolescence. Tests with high g loadings and negligible group 
factors have shown less susceptibility to cultural change (Cattell, 1958) and 
other environmental influences (Nisbet 1953, Burnett 1958). 


Theoretically, the stable g factor is compatible with the hypotheses of 
Hebb (1949) and Ferguson (1956) which suggest that abilities are developed 
primarily by positive transfer from the old learning to the new. Early superior- 
ity in processing and integrating skills should lead to increasingly broad and 
cumulative superiority of intellectual development in a variety of fields. 
Vernon (1960), too, sees support for the general construct in the developmental 
approach for “ those who, early in life acquire a larger stock of perceptual 
schemata and verbal habits are better able to build up more complex and more 
flexible schemata necessary for conceptual thinking ” (p. 41). So the general all- 
inclusive logical structures accumulate and proliferate, producing a broad general 
factor which, although difficult to define behaviourally, continues to account for 
most of the statistical variance in any battery of ability tests. 

In a comprehensive review of theories and research on intelligence, Hunt 
(1961) draws attention to the compatibility of apparently diverse interpretations 
of intelligence, which seem to be best explicable in hierarchical terms. 


“ A hierarchical factor structure, with Spearman’s g at the apex, as the 
English factor analysts have tended to conceive of intelligence is precisely what 
would be expected from considering abilities as the accumulative effects of the 
transfer of training or learning in varied situations . . . Piaget’s conception of 
accommodation corresponds to the change in function that occurs with training 


or learning, and Piaget’s conception of assimilation corresponds to the notion of 
transfer of training ” (Hunt, 1961, p. 359) 


This general factor at the top of the hierarchical model, is essential to the 
present study. To qualify as a measure of an all-embracing intellectual ability, 
while relatively free from the vagaries of environmental circumstances, a test 
must show a substantial loading on the general factor at the top of the hierarchy. 
Culture reduction is of little value if the test does not sample extensively that 
common denominator or ‘ transfer basis’ of the fluid set of abilities we under- 
stand by intelligence. 


IV.—EXPERIMENTAL PROCEDURE, 


Nine tests and subtests were selected as promising culture-reduced measures 
of general intelligence. In May, 1960, these were given to 271 pupils constituting 
a stratified sample from the Grade VII population in the public schools of 
Edmonton, a rapidly expanding central Alberta eommunity of nearly 300,000 
people. The variables of provincial examination averages, socioeconome 
status, geographic location, school type and residential mobility were con- 
sidered in selecting the sample, and a comparison with a random selection from 


the same population in terms of age, sex, occupational status and Laycock 1.0.» 
showed no significant differences, 
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The potentially useful culture-reduced tests selected were : 


3 ao Standard Progressive Matrices (1956). 
3) a tell Test of g “ Culture Free.” Scale 2. (1958). 
(4) ees horndike Non-Verbal Intelligence Test. Level 4 (1957). 
5) 1 one Thorndike Figure Classification Subtest. 
(3) pn tone Number Series Subtest. 
(7) Hoe Thorndike Figure Analogies Subtest. 
(8) eet coe Uni-Factor Test (1952) Figure Changes. 
(9) ara Towar Uni-Factor Test (1952) Mixed Series. 
‘alifornia Test of Mental Maturity (CTMM) Non-Language, Short Form, 
Elementary (1957). i i 
(10) oe variables for which data were gathered were : 
(iy Califonia Test of Mental Maturity, Language (1957). 
(12) Polanen Reawian Uni-Factor Spatial Test (Boots) (1952). 
13 ccupational Index Score. 
(is Home Index Score. 
) Year-end marks on teacher-made tests of attainment. 
d been administered 


the Cali : : ear, 1959, the same pupils ha 
b alifornia Achievement Battery, Junior High (1957), whose subtests, listed 
elow, were used to help identify factors in the factor analytic studies. 


15) California Reading Vocabulary. 


In May of the previous y 


ee California Reading Comprehension. 
lig) California Reading Total. 

ia California Arithmetic Reasoning. 

on California ‘Arithmetic Fundamentols. 
(2 a California Arithmetic Total. 

ba California Language. 

(22) California Spelling. 

(23) -California English Total. 


arative purposes were : 


1935)—raw scores obtained two years 


m Further data used for comp 
(24) Laycock Mental Ability Test ( 


earlier. 
administered in May, 1956. 


rogressive Matrices (1947) 


(25) Raven Coloured P. 
(26) CMM Primary (1953)—administered in May, 1950. 

The CTMM Elementary (1957) was also analysed by subtests. 
Bi CTMM Spatial. 
FA CTMM Logical. 
(29) CTMM Number. 
(30) CTMM Verbal. 

statement 


as obtained by assigning each pupil's e 
standard score from Blishen’s (1958) Canadian 
lata, and evenly weighted 
ccording to income and years of training required. The Home ees ope 
cousisted of twenty items, adapted for loc s from a scale devised © y 
Gough (1949). The purpos asures was not to isolate the crucia 
Variables operating to accelerate mental maturity, but rather to indicate when a 
umber of these stimulating correlates was presen tly or 
Ndirectly, and when their importance was reduced. 
tl On the basis of previous research, and the theoretical p 
( A following predictions were made : ; 
a) The nine culture-reduced tests*will show g loadings as high 


conventional verbal tests of intelligence, 


osition adopted, 


as those of the 
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(b) The nine culture-reduced tests will show negligible loadings on those group 
factors which appeaı to depend heavily on cultural acquisitions. — = 

(c) Socioeconomic status of father will show greater correlations with scores 
on the conventional verbal tests than it does with scores on the culture- 
reduced tests. , : 

(4) Over a period of four years (1956 to 1960) children from upper socio- 
economic levels will improve their status on the CTMM more than on the 
culture-reduced Progressive Matrices. , 

(e) An item analysis of the most promising culture-reduced tests will show that 
upper and lower thirds, by socioeconomic status, will succeed in similar 
ratios on each item. On the CTMM Verbal Test, discrimination indices 
will show greater variation from item to item. PA 

Additional evaluative criteria of relationship with school achievement, reliability 

and suitability for classroom use were also examined. 


V.—ANALYSIS OF RESULTS. 

Three socioeconomic variables were used in the anal 
T-score, home index raw score, and a combin 
equal weights, yielding what was termed a socioeconomic index. Preliminary 
analysis on the two component sub-scores showed an intercorrelation of 0-61 
which is close to Gough’s corresponding coeflicient of 0-65 (1949). The pattern 
of correlations with other variables was almost identical for the two measures 
(tho=0-87) suggesting that they sample highly similar correlates of mental 


ability. A split-half reliability coefficient of 0-77 was obtained from the Home 
Index questionnaire. 


(a) and (b) The two hypotheses concerning 
test were investigated by performing two fact 


yses—occupational 
ation of the two measured with 


and included many subtests. The second 
concentrated on the eleven total test scores (Tables 3 and 4) 


t atrix six principal components were extracted* but the 
last was discarded because it failed to reach significance level, according to 


the tests of McNemar, Burt, and Guilford and Lacey. The five significant 
factors were then rotated using criteria of positive manifold, maximization of the 
general factor and psychological meaningfulness. After four rotations a 
compelling psychological factor pattern emerged (Table 2). The factors were 
interpreted as general ability (g), a verbal educational factor (v : ed), socio- 

numerical ability (n) and, possibly, speed (sp). The fact 
r appeared in the analysis may be due to the low ceiling of 
the CTMM Spatial Subtest, or the strong speed element in the Holzinger- 
Crowder Test. The eleven total test scores were analysed similarly yielding 


ate that the tests most highly loaded 
; of g, the Progressive Matrices, an 
igure Analogies Subtest. The remaining six culture- 
reduced tests showed less promising g saturations, or significant loadings 02 
verbal or numerical factors $ 


with the general factor were Cattell’s Test 


* Communalities were estimated by calcul 
each variable with the 


ating the squared multiple correlation of 
s n-l remaining v; 
(1959) and Kaiser (1960) 


L1 ariables in the battery, as suggested by Wrigley 
A - Iteration was performed after replacing the communality with & 
value increased (or decreased) by one-; 


3 S and-a-half times the discrepancy between obtaine 
and estimated values, until convergence occurred. 
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TABLE 2 


ROTATED Factor SOLUTION AND COMMUNALITIES FOR TWENTY-ONE VARIABLES. 


(Loadings below -20 omitted) 


Test Name 2 1 
No 
1 Prog. Matrices . 708 542 
2 Cattell g eres 754 — — — 222 626 
3 ig. Class .. 582 355 
5 . Series 547 — — 395 — 486 
6 L-Th. Fig. Anal. ... 738 553 
2 Holz.-Crowd. Fig. Ch. 498 — — 211 369 440 
8 Holz.-Crowd. Series . 457 214 — 453 303 554 
11 Holz.-Crowd Spatial . 397 — = — 396 326 
12 Occ. Status .. . 251 — 683 — — 583 
13 Home Index .. 247 205 669 — — 602 
15 Reading Vocab. ... 335 743 — — 264 747 
16 Reading Comp. .. 501 618 — — — 681 
18 Arith. Reasoning 460 341 —- — — 369 
19 Arith. Fund. 450 436 — 442 — 629 
21 Language 417 589 — — 299 641 
22 Spelling 201 618 — — — 443 
24 Laycock ... 676 509 — = — 735 
27 CTMM Spatial 612 — — =x — 428 
28 CTMM Logical 660 — — — — 476 
29 CTMM. Number 639 201 — 331 — 563 
30 CTMM Verbal ........ 458 658 — — — 694 
% of Common Variance ...... 51-18 | 25-23 | 9-11 7-80 6:68 |100-00 
% of Total Variance ........,, 27 13-78 


TABLE 3 


CORRELATION MATRIX For ELEVEN ToraL Tesr Scores. 


(Decimal Points Omittted.) 


(515) 
443 (727) 
17 425 356 467 419 486 351 772 (800) 
20 5 ae 532 393 504 425 562 ‘G29 (574) 
= 3 2 408 363 466 5 619 33 598 (660 
24 507 498 Gat 1a = ; Ga 


428 451 431 732 713 550 642 (741) 


* Final estimate of the communality, 


= 
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TABLE 4 
Roratep Factor SOLUTION AND COMMUNALITIES FOR ELEVEN TOTAL TEST SCORES. 
(Loadings below -20 omitted). : 


Test Name 


Prog. Matrices 
Cattell g 
Lorge-Thorndi 

Holz.-Crowd. Fi 
Holz.-Crowd. 
CTMM Non-Lang. 

CTMM Language .. 
Calif. Reading 
Calif. Arith, 
Calif. Eng. . 
Laycock 


%6 of Common Variance 


% of Total Variance 


(e) To investigate the third hypothesis, the combined index a pea 
economic status was correlated with each test, and the difference be ween 
coefficients tested for significance. Table 5 shows, the correlations of pod 
economic status with the nine culture-reduced tests in the left column and, Sy 
verbal tests across the top. Significance levels of difference between coefficients 
are found in the body of the table. 


Only three of the observed differences : 
level, a finding which supports the hypothesis that the culturei die ee 
selected show less differentiation between socioeconomic Teve Mian Co c 
conventional verbal tests. The three tests found to have hipe E a 
showed satisfactory line: ion and so can be considered to 


rity of regressi 
third criterion of culture 


failed to reach the -05 significant 


-reduction quite adequately. 


TABLE 5 


ERENCES IN CORRELATION Cor 
SELECTED TESTS. 


ICIENTS BETWEEN Socio- 


SicNiricance LEVELS OF DIFF 
Economic STATUS AND 


y CTMM 
CTMM Laycock CTMM Verbal 


Ba Total Total Lang. 


with á 
SES. 384 : a3 
P -01 ; “Ol 
“Togressive Matrices. ..- "22 ‘01 E “01 
Cattell Test of g s 01 i -O1 
Lorge-Th. Fig. Class ‘01 $ -01 
Lorge-Th, No. Series -- shes -01 “02 -01 
Lorge-Th. Fig. Anal. .- ‘01 
Lorge-Th. Total koy 
CTMM Non-Lan 
Holz.-Crowd. Series s3 
Holz.-Crowd. Fig. Chi .. 


Tost 


-01 
-05 


g. 
“01 


01 
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(d4) The Progressive Matrices and CTMM tests were used to test the fourth 
hypothesis of change in relationship with socioeconomic status over four yo ie 
Since appropriate forms of the Matrices and CTMM had been administered to the 
sample in both Grades III and VII, it was possible to predict an increase in 
correlation with socioeconomic status for the more culture-loaded but not 
the culture-reduced test. The fact that the correlations between grade levels 
were similar for each test (-545 and -471, respectively) made the assumption of 
similar degree of overlap in the functions measured on both occasions appear 
reasonable, while the same coefficients were considered low enough to allow 
those socioeconomic variables associated with test scores to operate. 

Correlation coefficients with socioeconomic status were calculated for 
normalized T-scores on all four tests, and differences tested for significance.* 
Results showed an insignificant increase of 0-015 in the case of the Matrices, 
whereas the corresponding CTMM Total correlation increased from 0-215 in 
Grade III to 0-323, a difference which is significant at the -05 level. _As 
predicted, the abilities sampled by the Matrices test appear less susceptible 
to the cumulative effects of those socioeconomic variables which influence the 
mental development of children between Grades ITT and VII. 

(e) To investigate the fifth hypothesis, that a culture-reduced test would 
show less variation than a culture-loaded test in terms of item discrimination 
indices across socio-economic levels, the Progressive Matrices and CTMM 
Verbal Test were selected as representative of each category. The percentage 
passing each item in high and low socio-economic thirds was calculated for each 
test, distributions of discrimination indices drawn up for each test, and their 
variances compared. Only items with difficulty indices between 0-10 and 0:90 
were included in the analysis. 

The variances of the two distributions for the Matrices and CTMM Verbal 
Tests were calculated as 16-32 and 35-28, respectively, which, when compared, 
yielded a significant F ratio of 2-16 with 28 and 35 d.f. As hypothesized, items 
in the verbal test show greater variation in their discrimination between some 
socioeconomic levels than those of the non-verbal test. Although these dis- 
crepancies may not mar the predictive validity of the test assuming fixed 
treatments, they suggest that the verbal test is less suitable as a measure of 
intellectual potential. The Matrices discrimination indices were found to be 
similar throughout a wide range of difficulty. 

The criterion of relationship with school marks and standardized tests 

was then investigated. End-of-year examination averages over all school 
subjects for each of the thirteen classes in the study were correlated with each 
of the intelligence tests and subtests. Results from the California Achievement 
tests administered in the previous year were also correlated with the major 
variables, using the whole sample of 271 pupils. The median correlations for 
the thirteen school marks and the mean correlations for the three California 
Achievement subtests are included in Table 6. Although less efficient than 
the culturally weighted conventional tests in this ‘ cold-blooded ’ predictive 
situation, the correlations of the better culture-reduced tests are substantial 
enough to indicate promise for adaptive treatments. 
. Typical reliability coefficients reported from previous research are included 
in Table 6. A check was made on the Progressive Matrices in view of its promise 
in terms of the other criteria. A split-half reliability co-efficient of 0-91 and a 
Kuder-Richardson coefficient of 0-83 indicated a degree of reliability very close 
to the average value reported in other studies at this age level. (Burke, 1958). 


* The sample was reduced fro 


m 271 to 243 to test this hypothesis due to incomplete 
test data from Grade III, 


= 
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All tests were consi i 

i IL tes sidered convenient for classroom use, with the i 

= 1 p ossible 

ste be the Holzinger-Crowder subtests, whose administration O 

The Maras to be tedious and repetitive, and time limits somewhat unrealistic. 

pean atrices test was judged the most popular by pupils and its ease of admin- 
ation could certainly be regarded as an asset. 


VI.—DiscussION AND CONCLUSIONS. 


ave been drawn together in Table 6. In 
at, assuming adaptive rather 
ng and low correlation with 
ht than the present relation- 


E. he main findings of the study h 

ad a this table it should be remembered th 

St hana treatment, the criteria of high g loadi 

shi conomic status should be accorded more weig 
1p with school achievement. 


Table 6 indicates that the Progressive Matrices best meets the criteria 
test. In addition to 


A Ge por evaluation of a culture-reduced intelligence tesi i 
Ban gh g loading and low correlation with socio-economic status, this test 
oem ee good predictor of school marks, has a respectable reliability, 
giv S21 change in relationship with socioeconomic status over four years, and 
ioe little indication of variation 1n item discrimination across socioeconomic 
di T, s. Moreover, the test has face validity in that the progressive increase in 

ificulty makes it a tester of what it teaches and it appears to be little affected 


y the previous experience of the testee. 
s The shorter Cattell Test of g and Lorge-Thorndike Figure Analogies 
ubtest rank next. Reorganization of instructions, lengthening, and item 
eliability 


analyses might well improve these tests, particularly in relation to reliabil 
vhere they are weakest. The remaining 


a correlation with schoool marks, Y } reake 
Ea ture-reduced tests showed less promise in this investigation although further 
udy is currently being conducted on their potential values in relation to racial 


and language variables. 


TABLE 6 
SUMMARY or SELECTED INTELLIGENCE Tests EVALUATED AGainst SIX CRITERIA. 


Median r | Mean r Split-Hall] 
Sch.Marks} Acht. Relia- 
bility 


x 8. 
Test loading Tests 


Prog. Matrices 78 ae ‘35¢ 
Cattell g. sas seene -79 : k fa A 
-58 P 5 5 84 
porge-Th. Series .- 55 S x k a = 
orge-Th. Fig. Anal] *74 ; 43 47 .93 
Orge-Th. Total -75 4 ; . -49 a 
Hoiz Crowd. Series -46 K 2 “39 mes 
celz-Crowd. Fig.Ch} 52 2 7 re 38 91 
CAM Non-Lang. z K ; F 66 +95 
ee Lang. sess -58 5 65 95 
La M Total . 64 -95 
Yyeock Total ..| 68 ; À 


fiin, 


* 
Re-test reliability coefficient. 
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For further research, too, one might predict considerable potential for an 
unspeeded test which used the Matrices approach of testing what it taught, and 
which supplemented the abilities measured with the most discriminating figure 
analogy and perceptual series items, or variations thereof. 


In summary, it seems reasonable to conclude that as tests of intellectual 
potential, the conventional verbal intelligence tests examined in this study show 
discrimination against children from lower socioeconomic backgrounds. It has 
been shown that, for 12 and 13-year-old pupils : 
(1) Culture reduced tests sample the general intellectual 

or better than conventional tests, 

(2) Most culture-reduced tests show 
factors. 

(3) Culture-reduced tests show significantly 
economic status than do conventional tests. 

(4) A conventional verbal test (CTMM) showed significant increase in relation- 
ship with socio-economic status over four years, whereas the Progressive 

Matrices showed no change. - 

(5) Verbal items from the CTMM showed greater variation in item discrimina- 
tion between social classes than did items from the Progressive Matrices. 


Generalizing further, the three most promising tests identified by the study 
could profitably be put to greater use in the estimation of intellectual ability 
for adaptive teaching purposes, and particularly in talent identification and 
classification of pupils from underprivileged and foreign language bekgrounds, 
for distinguishing between mental defect and reading difficulties due to other 
causes, for holding intelligence constant in research experiments where verbal 
ability is irrelevant, and generally, as a supplement to verbal intelligence 
measures in the routine testing of children from all socioeconomic levels. 


ability factor as well as 
negligible loadings on verbal and numerical 


less relationship with socio- 
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THE RELATION OF PSYCHOLOGICAL TEST SCORES TO 
ACADEMIC BIAS 


By LIAM HUDSON 
(The Psychological Laboratory, University of C. ambridge) 


Summary. On entering the Sixth Form, English schoolboys choose between 
Arts and Science subjects. Effectively, this choice commits them to one career 
field or the other. The present research is aimed to discover whether psycho- 
logical tests reflect the academic bias of such boys towards Arts or Science 
subjects ; and whether they give any indication of boys’ aptitude for specialised 
work at university and in research. A battery of conventional psychological 
tests was given to a sample of 315 15 and 16-year-old schoolboys, drawn from 


five schools. These boys had just finished the G.C.E. ‘O’ level examination, 
and stood a good chance of reaching university. 


Results show that the tests used do reflect academic bias surprisingly 
well. On the other hand, there are signs that the relation of tests scores to 
aptitude for university and research work will prove to be complex. Some of 
the most gifted young Science specialists, for example, have scores which bear 


no relation to the trend of results in the sample as a whole. The implications 
of these and other findings are discussed. 


I.—Inrropuction,. 

On entering the Sixth Form, at the age of 15 or 16, the clever English 
schoolboy has to choose between Arts and Science subjects. The choice he 
makes determines to a considerable extent the eventual course of his career. 
If he becomes, say, a History specialist, he will find it difficult later on to change 
to Classics or French ; and even more difficult, if not impossible, to change to 
Mathematics or Biology. Obviously, the more relevant information he has 
before him when making this decision, the better. 


It is this choice which forms the context for the present research. Its 


ultimate aim is a series of tests which would show a 16-year-old (a) roughly 
what his own potentialities are, and (b) what demands would be made on him by 
various careers at university and in research. In order to develop and validate 
such tests, a series of longitudinal experiments are needed, in which individuals 
test scores as schoolboys are related to their achievements as adults. The 
present paper is merely an interim report on the first stage of one such project. 
It concerns the basic question: do test scores reflect a schoolboy’s bias of 
ability towards Arts or Science at the age when the crucial choice is being made ? 
Although this relation of test scores to academic bias will be discussed in detail, 
no attempt will be made to show that the relation is stable through time, OT 
more generally that the tests have predictive validity. The criteria used are 
those appropriate to the Lower Sixth Form, and these are not grounds for 
vocational advice. 

Relatively little is known about the relation of test scores to academic 
bias. During the last twenty years, many attempts have been made to validate 
differential aptitude batteries, but these have met with little success (Super, 
1957 ; Horst, 1959; Anastasi, 1961). A notable exception is Crawford and 
Burnham’s (1946) report that the Yale Differential Aptitude Battery has 
differential validity in predicting college grades. They also report (p. 242) that 
certain undergraduates have marked and persistent verbal/spatial biases © 
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test re. Simi ; 

Baten tie inpar biases have been reported by Bradford (1948) among 

eee a bear, The oe in both cases imply that such biases of 
Sp ela o biases of academic abilit i ; 

relations have not been substantiated (Burt, 1950). y and nae 


(1) The Sanat IJ.—-METHOD. 
ie. ie present results are based on a sample of 315 15 and 16-year-old school- 
mec ae CSE were drawn from five schools : two public boarding, one public day, 
boys ie grammat All five have high academic reputations. In each case, the 
oO? iene tested as they entered the Sixth Form, having just taken the G.C.E. 
reheliing is examination. They were selected as having a reasonable chance of 
intellioe man actelye The general level of their ability was high, the mean 
Cami gence test score being only a few points lower than that found among 
bridge undergraduates (Hudson, 1960). 


(2) Subject-Groups and Criterion-Groups. 
In analysing the results, two systems of classification are used : Subject- 


Groups and Criterion-Groups. In the first of these, boys are grouped in terms 
History, Physical Science, 


x mheir special subject : C lassics, Modern Languages, i T 
corn iology. This method is straightforward and yields a number of interesting 
mii ar On the other hand, it gives little indication either of a boy’s 
anal ude for his special subject, or of his academic bias. The most useful 
of ysis is one which relates test scores to criteria both of academic success and 

academic bias. In the present case, this is done by using Criterion-Groups. 
ess of each boy were 


“ise Both the academic bias and the academic succ 
Ssessed from questionnaires, staff reports and G.C.E. “O” level results. Four 


‘iterion-Groups were then selected, with twenty-five boys in each: ‘Arts 
Pecialists ’ consisting of boys w. ful in Arts subjects and 


po ts | ho are highly success 
R jan at Science ; ‘ Arts All-rounders ” who are highly successful at Arts and 
a equate at Science ; “ Science-All-rounders ’ who are highly successful at 
eel and adequate at Arts; and ‘Science Specialists ’ who are highly 
Successful at Science and poor at Arts. As they were restricted to boys who are 
aoe successful in their own subject, these Criterion-Groups reflect both 
` ifferences in academic bias and differences in academic level. Differences 1m 
academic bies are reflected in comparisons between the Criterion-Groups ; 
ifferences in academic success are reflected in comparisons between the 

riterion-Groups and the remaining 215 boys in the sample. Four Criterion- 

eter rather than five were used because genuine all-rounders proved 
clatively hard to find. The number of boys 1m each of the Subject- and 
t to note that the two 


suiterion-Groups is shown in Table 1. (It is importan the 
peteing of classification are independent, in the sense that a Classicist may be 
und in either of the Arts Criterion-Groups or in Rest-of-Arts, but not, of 


Cours, A > 
urse, among the Scientists.) 


(3) The Indices Used. 
used was A.H.5, a group test of high-grade 


atclligence specially suited to $ dent samples (Heim, 1956). The test is 
dministered in two Parts: Part I being verbal and numerical, Part II being 
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Accuracy. A number of accuracy scores were derived from AHS, the 
most useful being ‘ Worse Accuracy ’ and‘ Range Accuracy.’ A boy’s accuracy 
on Parts I and II of A.H.5 were calculated separately—the number of correct 
answers as a percentage of the number attempted. The lower of the two 
percentages was then defined as his Worse Accuracy, and the difference between 
the two as his Range Accuracy. The accuracy score given in Tables 2-5 is 
Worse Accuracy. 

Spatial Ability. An experimental spatial test was uscd as none of those 
already published was sufficiently dificult (Heim and Watts, 1957). Jt was 
aimed to tax students’ ability at manipulating geometrical shapes in their 
mind's eye, but in practice was found to correlate substantially with General 
Intelligence (r=approximately 5}. 

Vocabulary. An experimental, multiple-choice form was used, published 
tests being too easy (Heim and Watts, 1961 De 

General Knowledge. 
factual knowledg 
music, science. 

Cognitive Bias. A differential score 
ability in the cognitive tests. This contrasted his Verbal ability on the one 
hand with his Numerical and Spatial abilities on the other, (These scores were 
derived from A.H.5, A.H.5 and the Spatial Test, respectively.) 

Bias of Interests. Interests were assessed by a system of cross-checks. 
Five sources of information were used, although all five were not available m 
every case; questionnaire, form-ratings, general knowledge, school staffs, 
interviews. A differential score was used to measure each boy's bias towards 
“cultural and political’ as opposed to ‘ practical and outdoor’ interests. A 
working definition of this second category of interest was reached with some 
difficulty. Certain activities contribute heavily to an Arts/Science discrimina- 
tion: car and radio mechanics, model building, carpentry, photography, 
camping, rock climbing, sailing, bird-watching. While others, ostensibly 
similar, show a significant tendency to obscure it : cine-photography, model 
railways ; hunting, shooting, fishing, cycling, skiing, foreign travel, Similar 
difficulties were found in the ‘cultural’ sphere; interests in literature and 
painting contribute heavily to an Arts/Science discrimination, whereas an 
interest in music tends to obscure it. As the present aim is primarily dis- 
criminative, an operational definition of these categories was felt to be in order. 
So those interests already mentioned as obscuring the Arts/Science discrimina- 
tion were left out of account. 

With one exception, the seven indices mentioned 
independent of one another, the correlations be 
‘5. The exception concerns the scale of Cognitive Bias and the Spatial Test from 
which it is partially derived, r in this case being of the order of -7. Scores on the 
first five of these indices are graded on a five-point scale, A-I5, Grade A repre 
sents the top 10 per cent. of each school sample; B, the next 20 per cent. + 
C, the middle 40 per cent. ; D, the next 20 per cent. ; and E, the bottom 10 pe! 
cent. A fuller account of tests and methods is given in Hudson (1961). 


An experimental open-ended test was used, involving 
e about 100 men, famous in current affairs, literature, painting, 


expressed the bias of each boy's 


above are reasonably 
tween them ranging from zero tO 


, I1I.—Resvrts. 
(1) Subject-Groups. 


The first aim in presentin 
Subject-Group ; to show, 
ences between Classicists 


g the results is to give test ‘ profiles y for pa 
for example, whether there are characteristic diffe 9 
and Historians. In the interests of clarity, Table 


EO O 
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summaris se profi itati 

To ns pi proie qualitatively. The over-simplification which this 

patot Subject-Gro y unimportant, because the difference between any given 

whichiol the dia roups is expressed quantitatively in Table 3. Table 3 indicates 
ifferences shown in Table 2 are of real discriminative importance. 


Tabl P wi : 
and ecg deta that Classicists are typically of average ability or better, 
intéiaste ars a Bone particularly strong in accuracy and vocabulary. Their 
arë ERE a ag and they have no marked cognitive bias. Modern Linguists 
DEENA A a = apart from a verbal cognitive bias and, very surprisingly, a 
Linguists iad pia ulary—an inexplicably large number of highly successful 
bylo as 2 a ocabulary grades of D or E. Historians are clearly distinguished 
Physical s 3 e scores and a strong ‘cultural and political ’ bias of interests. 
of saenthi TS have high scores on cognitive tests, and a marked bias both 
Sironi era) ility and of interest. Biologists are undistinguished except for a 

g ‘ practical and outdoor ’ bias of interests. 


TABLE 1 


Tue S 
è SAMPLE TE r s 
LE TESTED, DIVIDED IN TERMS OF (i) SusjecT-GROUPS, AND (ii) CRITERION- 


GROUPS. 


SUBJECT-GROUPS CRITERION-GROUPS 


Specialists 
All-Rounders 
Rest-of-Arts 


Classics 
Modern Languages 
History 


Science Physical Science Specialists 
All-Rounders 


Biology 
Rest-of-Science 


TABLE 2 


A QUALITATIVE Summary oF TEST SCORES. 


GENERAL SpaTIAL VOCABU- GENERAL COGNITIVE 
INTELLIG- ACCURACY ABILITY LARY Kknow- Bras INTERESTS 
LA ENCE LEDGE 
SSt¢g 
tome Fair V. Good Good v.Good Fair None Diverse 
PRN Law. 
Hisy, “ANGUAGEs | Fair Fair Fair Weak Fair Verbal Diverse 
Story 
AT ee ees Poor Weak Poor Fair Fair Verbal Cultural 
Tys & Political 
TP 
Scirncr ..| V. Good Good V. Good Fair Fair Numerical Practical 
Borg & Spatial & Outdoor 
Gy 
| Enia si Fair Fair Fair Weak Fair None Practical 
= & Outdoor 
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TABLE 3 


INDICES WHICH DISCRIMINATE BETWEEN ANY GIVEN PAIR OF SUBJECT-GROUPS. 


MODERN History PHYSICAL BioLoGY 
LANGUAGES SCIENCE 


Vocabulary Spatial Interests Interests 
ICS Ty pi 
pe Accuracy Vocabulary 


MODERN LANGUAGES .. Intelligence Cognitive Bias = 


HISTORY INTERESTS INTERESTS 
Cognitive Bias 
Spatial 
Intelligence 


PHYSICAL SCIENCE .... 


STRONG DISCRIMINATION =0%—15%, errors. 
MODERATE DISCRIMINATION = 16%—25% errors. 
Weak Discrimination=26%—35% errors. 


In describing the discriminations between the various Subject-Groups, 
three standards of discriminative efficiency are established : Strong, Moderate 
and Weak. A discrimination between two groups is defined as Strong if the 
proportion of errors involved is 15 per cent. or less, as Moderate if the proportion 
of errors lies between 16 per cent. and 25 per cent., and as Weak if it lies between 
26 per cent. and 35 per cent. For example: if 15 per cent. of Historians lie 
above a given cut-off score on the vocabulary test while only 25 per cent. of 
Classicists lie below it, the proportion of errors involved in discriminating 
between these two groups is 20 per cent., and the discrimination is defined as 
Moderate. These three categories are distinguished typographically in both 
Tables 3 and 5; and in each cell of these Tables, the indices are arranged in 
descending order of discriminative efficiency. The cut-off score used in each 
case is the median for the whole schoolboy sample. All the discriminations 
shown in Tables 3 and 5 are statistically significant (P=0-05; X2 Test with 
Yates’s Correction). 


Although none of the discriminations between Subject-Groups is Strong, 
Table 3 shows that there are a number of significant differences between boys 
who specialise in the various academic subjects. The most obvious is that 
separating Historians from the Physical Scientists. For the rest, Classicists are 
distinguished from Modern Linguists and Biologists by their high vocabulary 
and from both groups of Scientists by the bias of their interests. Modern 


Linquists are distinguished from Physical Scientists by their cognitive bias. 


(2) Criterion-Groups. 


The Criterion-Groups consist of boys who are highly academically success- 
ful; and they differ only in their academic bias. So contrasts between 
Criterion-Groups should reveal whether academic bias is reflected in test scores ; 
and the contrast of the Criterion-Groups with the rest of the sample shoul 
reveal whether academic success is so reflected. The profiles of the a 
Criterion-Groups and the remainder of the sample are summarised qualitative! 
in Table 4. 
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I PA 
ae the Arts Specialists have poor cognitive scores. They are also 
Caa ae K Fom Scientists by their interests, and from all other Criterion- 
en, eet heir cognitive bias. The other three Criterion-Groups tend to have 
i ereet and the differences between them are relatively slight. These lie 
ous S, cognitive bias and, perhaps, spatial ability. The two ‘ Rest’ 
eer re oe similarly. It seems that they occupy middle ground between 
estie pecialists at one extreme and the remaining three Criterion-Groups at 
The efficiency of i in distinguishi i i 
Criteri cy of each index in distinguishing between any given pair of 
z autem is shown in Table 5. Again, only statistically significant 
aces are mentioned. The results may be summarised as follows : 


Seas? On the whole, discriminations between Criterion-Groups are much 

ger than those between Subject-Groups. 

ane Oy Among successful schoolboys, one distinction is of over-riding import- 

from the ee aa separates Arts Specialists (i.e., those who are poor at Science) 

ia (c) Relative to this gulf, the dichotomy 

Pac a tant. Only his interests reveal the direction in w! 
nder has specialised. 


ato (3) Academic bias is reflected e: 
telers Scientists. Or—to express t 
oe weakness in Science excellently, 
and (e) The relative similarity of Arts Specialists and Rest-of-Arts groups, 
don he dissimilarity of both from Arts-All-rounders emphasizes that tests 
1l ot measure the qualities underlying success in Arts, and also that the Arts 
-rounder is a relatively unusual type. 
S (f) There is a surprising, although non-significant, tendency for Arts 
Pecialists’ cognitive scores to De worse than those of the Rest-of-Arts group. 
disti (g) The indices : racy, and Spatial Ability all 
m Inguish boys who are good at Science from those who are poor. Neverthe- 
7 nat very high Accuracy scores are largely restricted to Arts AlLrounders, 
Vhile the Accuracy scores of successful Scientists tend to cluster in the B and c 
&rades. Vocabulary measures academic aptitude in Arts and Science alike ; 
A, to a lesser extent, the same seems to be true of General Knowledge. 
Ognitive Bias gives a good indication of academic bias, particularly among 
at boys; and Bias of Interests distinguishes Arts boys as a whole from 
Clentists. - 


between Arts and Science is less 
hich a successful all- 


xcellently among Arts boys, but poorly 
he same relation in reverse—the tests 
but only poorly reflect weakness in 


Tables 4 and 5 do not give a clear idea of how efficiently the various academic 
8roups could be distinguished in practice, if all seven indices were combined. 
Ca a rough check on the strength of these discriminations, the scores of all the 
id, Iterion-Group members were placed on cards and an attempt was made to 
entity each boy in the light of the evidence given In Tables 4 and 5. The 
atisfactory : 


Tes 
€sults, shown in Table 6 are most s 
n-Group members are identified correctly, 


(a) Two-thirds of the Criterio. 
ps only some 25 per cent. would have been indentified correctly had they 
in en sorted at random. As many aS 97 out of 100 are placed either correctly or 
4 neighbouring group. 
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TABLE 4 


CRITERION-GROUPS : QUALITATIVE SUMMARY OF TEST SCORES. 


GENERAL SPATIAL VOCABU- GENERAL CocnitivE Bras ong A 
INTELLIG- ACCURACY ABILITY LARY Kxow- Bias INTEREST! 
ike LEDGE a 
95 
i al A 
ARTS SPECIATISTS vits Poor Poor Poor Good Fair Verbal a paisa 
25 
ARTS ALL-ROUNDERS ..| V.Good V.Good Fair V.Good Good None E Political 
i 25 
SCIENCE ALL-ROUNDERS| V. Good Good Good V. Good V. Good None No Bias a 
25 
eE a EA |) VaGood Good V. Good Good Good Numerical Practical 
& Spatial & Outdoor 4 
82 
REST-OF-ÅRTS ........ Weak Weak Weak Fair Weak None No Bias f | 
d 13% 
REST-OF-SCIENCE ,... Fair Fair Fair Weak Pair None Hanel 
& Outdoor 


TABLE 5 


INDICES WHICH DISCRIMINATE BETWEEN ANY 


GIVEN PAIR OF CRITERION-GROUPS. 


ARTS SCIENCE SCIENCE Rest-or-Arts Rerst-OF- 
ALL-RounpErs| ALL-RounDERS SPECIALISTS ScrENCE 


ARTS ACCURACY SPATIAL SPATIAL 
SPECIALISTS ..| CoGNitIVE Bras| Accuracy COGNITIVE BIAS CoenitivE Bis 
INTELLIGENCE | INTELLIGENCE | INTERESTS Spatial 
Spatial Cognitive Bias | INTELLIGENCE Accuracy 
Interests Accuracy Vocabulary 


INTERESTS 


Arts 


INTERESTS INTERESTS INTELLIGENCE | INTERESTS 
ALL-ROUNDERS 


Y 
Accuracy Mocapuran 
Vocabulary Intelligent 
Cognitive Bias 


SCIENCE 


FE A Y 
Cognitive Bias | INTELLIGENCE | VocABULAR 
ALL-ROUNDERS 


GENERAL General 
KNOWLEDGE Knowledge 
Vocabulary Intelligence 
Accuracy 
Spatial 


igent? 

SCIENCE Sp. Intelligen® 
ATIAL 

SPECIALISTS .. 


Cocnitive Bras} Spatial 
INTELLIGENCE 
Interests 
Accuracy 


moa Bias 
Rest-or-Arts Cognitive 


Interests 


p N 


STRONG DISCRIMINATION = 0% —15% errors. 
MODERATE DISCRIMINATION = 16% —25% errors. 
Weak Discrimination =27%—35 % errors. 
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TABLE 6 


AN ÅTTEM EN 
MPT to IDENTIFY CRITERION-GRoup MEMBERS, Usinc ALL SEVEN INDICES 
TOGETHER. 


Identified as— 
“Arts AILR” Sc. All-R” “Sc. Spec.” 


“ Arts Spec.” 


n 


ARTS SPECIALISTS & 


Arts ALL-RoUNDERS 19 


SCIENCE 
ALL-ROUNDERS 


SCIENCE SPECIALISTS 


bety (b) There is some confusion between neighbouring groups, particularly 
> veen the two groups of Scientists ; but the discrimination between the other 
groups is good, and that between the two extremes, the Arts and Science 

vas identified as a 


Specialists, is almost complete. Not one Arts Specialist v 
clence Specialist, or vice versa. 

Stir (c) The discrimination between 

E ; only four Arts boys were identi 
ere identified as Arts boys. 


Arts and Sciences is also surprisingly 
fied as Scientists, and only six Scientists 


EE The samples used here are rather small, and a certain degree of chance error 
tl inevitable. When the research is extended to other schools, particularly 
hose with different academic traditions, changes must be expected. (It has 

een found, for instance, that the intellectual calibre of Biology specialists varies 
Pe eably from one school to another.) It seems unlikely, on the other hand, 
hat the major differentiations will be materially altered. 


(3) Other Findings. 
(i) Gifted Scientists. The general level of test scores among successful 
tice, most gifted young scient- 


Physi s Re dae $ 

ja ysical Science specialists is high ; and, in prac 

Sts are easily identifiable from their scores. There are, nevertheless, a few 
1 Science specialists whose scores are unaccountably 


iXceptionally gifted Physica c S i 
(OW. Perhaps three cases are sufficient to illustrate this type: D., a mathematical 
Prodigy ’; P., a Chemistry specialist who has since gained an Open Scholarship 

3 g scientists at the 


at Cambridge ; a kabl up of youn 
s ge; and R., the best of a remarkable group ot y' t 
Public day school. Their test scores are respectively : General Tatelligenre = 
Ge C; Accuracy—E, C, D ; Spatial Ability—A, D, C; Vocabulary—B, B;C; 
eneral Knowledge—C, B, A. All three have strong or fairly strong interests, 
oli tical and outdoor ’ bias. 


ut none has either a ‘ political and cultural ’ or ‘ prac 
All three are poe by their form-mates to be hard-working. In general, 


heir remarkable talents reflect themselves in their test scores in hardly a 
Single respect. 

Although such cases acco ; the whole sample, 

i obvious is that the boys concerned were 

ost of them 


m, this can 
hg preceded 


he and as their a 
m, they were watched with special care 
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ibility is that these boys were made over-anxious by the novelty of the test- 
foe Again, his seems feasible in a few cases but not in all. There are 
marked differences of personality and emotional maturity within this group, 
so it is unlikely that a single emotional factor could account for all of them. 
A third and much more likely explanation is that there are important scientific 
abilities which the present tests miss. There is, after all, no prima facie reason 
for believing that the complexity of scientific work can be epitomised in the 
present handful of paper-and-pencil puzzles. Indeed, it would be strange if it 
could. There are several normal scientific skills which conventional tests take 
little account of ; the ‘inductive ’ skills of marshalling complex evidence, for 
example, and accurate generalisation from incomplete evidence. Also there 1s 
the whole area of creativity (e.g., Getzels and Jackson, 1962). If new testing 


methods are needed, the best way of discovering them is presumably a detailed 
study of present anomalies. 


(ii) Extreme Arts Specialists. Another tendency obscured in the general 
presentation of results concerns a small group of Arts Specialists. These are 


boys who are outstanding at Arts subjects but very weak at subjects which tax 
their powers of formal reasoning, e.g., M 


athematics, Physical Science, Latin. 
This group shows a strong negative relation between such measures as General 


Intelligence, Accuracy, Spatial Ability on the one hand, and General Knowledge 
on the other. The scores of three of these extreme Arts Specialists—C., G., S.— 
are: General Intelligence—E, E, E ; Accuracy—E, E, E ; Spatial Ability—E, 
E Bg Vocabulary—B, C, B ; General Knowledge—A, A, A. All three have a 


very strong ‘ political and cultural ’ bias of interests, and a heavily verbal bias 
of cognitive ability. Both C. and G. are exceptionally hard-working. The 
General Knowledge scores of th 


ese three boys are, respectively, the fourth 
highest, third highest and the highest in the whole sample ; while, with one 


exception, G. has the lowest General Intelligence in the whole sample, and C. 
the lowest but one. As with the Physical Scientists mentioned above, this type 
is exceptional ; such boys account for less than 5 per cent. of the whole sample. 
Yet, they are possibly of even greater practical importance, because it is 
precisely this type of boy who is filtered out of the academic system by 
university requirements in Latin and Mathematics. 

(iii) Negative and Compensatory Relations. This negative relation among 
extreme Arts Specialists seems to be one instance of a more general pattern. 
Among Historians and Modern Linguists there is a ma 


rked tendency for a high 
score on any one test to be counter-balanced by Jow scores on others. In cases 
where a boy’s scores were all low, he was found almost invariably to be excep- 
tionally hard-working. So it seems that in History and Modern Languages 
certain mental abilities are mutually inter-changeable ; and that where ability 
is lacking, hard work will serve in its stead. It will be interesting to see which Q: 
the various test profiles found among Arts boys bode well academically ; 
and whether qualitative differences of intellectual type are detectable between 
them. Why this form of equivalence should not hold among Classicists, Physica 
Scientists or Biologists is not clear. One possibility is that History and Modern 
Languages admit a wide range of differences in intellectual ‘ style,’ whereas 19 


Classics and Science there may be a greater tendency for one intellectual method 
to be the best. à 


These results suggest that a compensatory mechanism may be at work 1” 
some of the boys concerned ; that special weakness in, say, formal reasoning 
has caused them to concentrate on general knowledge. Compensation of E 
kind is illustrated in a case of sibling rivalry. These two brothers were 18 month: 
apart in age, yet the younger had overtaken the older academically. They he 
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an unsati a 
crag ates naa home background ; and at the time of his interview, the 
cpa ner showed obvious signs of emotional disturbance. The younger 
spatial on a Physical Scientist with cognitive scores above the average, a 
i raor abilities, poor general knowledge and exclusively practica) 
bite oF bits he elder was a Historian with very low cognitive scores, a verbal 
fies is bility, good general knowledge, and exclusively cultural interests. In 
, the elder boy had concentrated his efforts in precisely those areas where 


his brother was weak. 


(iv) Accuracy, Emotionol Stability and Originality. Previous work has 
ed as an index of emotional stability 


fa both Worse and Range Accuracy were 
nd to correlate substantially with interview ratings of emotional stability 
n=5!. Individual cases also 


of some previous sugges 


oo of originality. Guildford (1950), for example, has suggested that creativ- 
ity may stem from a general distaste for repetition ; and Getzels and Jackson 


(1962) found ‘ creative ’ students imp utine of classroom work. 


IV.—CONCLUSION. 


bis With certain important reservations, we may conclude that the academic 
Dias of clever schoolboys does reflect itself in test scores with surprising clarity. 
versity and in research, the present tests could 


If this relation holds true at uni 
I of some use in advising schoolboys about their choice of career. On the other 
land, there are good reasons for expec weaken. The demands 
made by a particular subject, Physics for example, may well change radically 
between G.C.E. ‘ O ’ level and Tripos ; wo i > differ from research ; 
and the various branches of research in a 
themselves (Roe, 1953). So although conventional tests may a 
proved their worth at the Sixth Form level, the influence of ill-defined qualities 
like originality and maturity of judgment may well increase as the present 
ae matures ; and if they increase, conventional tests may tend to lose theit 
alue. 
emains to be 


Even at the Sixth form level, much ri be don 
ts, nor do they distinguish 


done. The present tests 
certain exception- 


do not reflect ability in Arts subjec ept 
ally gifted young scientists. New tests are certainly needed, but there Js little 
Indication of what shape they should take. The abilities which the ee 
tests fail to detect may be primarily cognitive, €-8:) the ability to marsh; 
complex evidence. T. hey may be more personal, €.8-» imaginativeness, maturity 
Of judgment, insight into human motives. Or they may involve differences © 
intellectual attitude and ‘ style’ (Rapaport 1957 ; Getzels and Jackson 1962). 
ys with relatively poor test 


co The fact ins that there are brilliant bo i 
scores, and cannas dull boys whose scores are relatively high. As Getzels 
T Jtional psychological tests normally account for 
ms that the 


oe Jackson point out, convention c n 
ded than a quarter of the variance in academic success ; and it see $ 
eductive, puzzle-solving abilities which these tests do measure have little 
faring on intellectual originality. So although there is scope for new cognitive 
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dox kind, the most fruitful research may well lie in those areas 
Deets ae style’ where cognitive and personal qualities interact. ey 
the past there has been a tendency to approach these areas obliquely, using 
projective techniques like the Rorschach (Rapaport 1957.) ; but SscnaHons 
of type emerging from this work have not yet been related to differences o 
attitude and ‘style ° discernible in academic practice. Obviously, if ties 
distinctions are to stand any chance of explaining intrinsically interesting 
phenomena, this gulf between the laboratory and the world 


vital. If test scores are to be 
is what validation should cons 
intellectual workers at work, to analyse 
they do it. The immediate task, in other 


y not only shows 

s thinking, but „also 
problems of description which these qualitative differ- 
ences present. Perhaps it is a good omen for the future of mental testing that 
the statistics these authors use are largely rudimentary. 


Finally, the limitations of the present results must be reasserted. In the 
first place, the intellectual demands made by the various Sixth Form subjects 
are relatively elementary ; in the second place, the choice between them by no 
means determines the course of a schoolboy’s career. A Sixth Form History 
specialist may end up as an accountant, businessman, politician, anthropologist, 
soldier, barrister, clergyman. He may have talents unrelated to his job—he 
may work in a bank and write poetry, or in a customs house and paint, Or he 
may have talents which circumstances force him to neglect. The present 
categories are bound to be provisional, and no really satisfactory assessment 
can be made until the present sample has been followed through university and 
beyond. Their test scores as Schoolboys can then be related not merely to 


formal criteria like degree class, but to full descriptions of their intellectual lives 
as adults. 
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CHANGES IN TERMAN-MERRILL I.Qs. OF EDUCATION- 
ALLY SUB-NORMAL BOYS 


BY 
C. S. RUSHTON 
(Sometime Educational Psychologist, Kent School Psychological Service) 
AND 
A. E. STOCKWIN 
(Sometime Deputy Headmaster, Seabrook Lodge School, Hythe) 


Summary. Test results on the Terman-Merrill* 1937 revision of the Sta nford- 
Binet Intelligence Scale (Form L) were examined for 111 ‘ educationally 
subnormal ’ boys in a residential special school. A comparison between results 
on entry at the age of 7 years and above, and before leaving between 14 and 16 
years, showed a deterioration in mean I.Q. scores, despite the fact that the 
Roberts-Mellonej (1952) corrections were applied. 


The proportion of ‘ verbal subtests ’ passed was consistently lower than 
the proportion of ‘ non-verbal ’ tests,’ at all age levels. 

The hypothesis that the deterioration in mean 1.Q. scores between mean 
C.A. 9} and 15} years, was due partially to the increased verbal weighting on 
this test at the higher age levels, was examined and supported. 

It was suggested that the test under consideration is unsuitable for the 
all-round cognitive assessment of educationally sub-normalt children. Suggest- 


ions were made for a more suitable type of test, and for further research on 
Terman-Merrill (L). 


I.—INTRODUCTION AND PREVIOUS RESEARCH. 


Previous investigations on this subject have yielded conflicting results. Many 
of the earlier findings were consistent in that the mean I.Q. of dull children 
appeared to decline with age. Thus, Phillips (1940) concluded that for children 
with Terman-Merrill I.Qs. between 50 and 80, there was a tendency for a general 
and substantial decline which increased progressively with age. Parker (1934), 
Thorndike (1940) and Stott, (1960) all show increasing deterioration of I.Q. 
scores with increasing interval between test and retest. 


Perhaps the first and most striking contradiction of these findings came 
from Wellman (1934), who reported that all the groups which she tested tended 
to show improved I.Qs. while ‘ the most backward group’ showed the greatest 
increase. Schmidt (1946) and the Clarkes (1958 and 1959) also reported striking 
gains in the I.Qs. of the ‘ feeble-minded,’ but the former study (Schmidt's) 
was severely criticized by Kirk (1948) and cannot now be taken seriously. 


More recent studies have had available more accurate measuring techniques- 
Collman and Newlyn (1958) for instance, on a test-retest study of education- 
ally subnormal pupils in special schools in Essex, found that the use of the RM 
correction table (1952) was fully justified. On re-test they found no significan 
mean change in I.Q.—but this was after a period of only one year. Hiskey an¢ 
Sadnavitch (1958) and Scarr (1953) report similar findings. 

* From this 
į From this 
$ From this 


point onwards referred to as T-M (L). 
point onwards referred to as R.M. corrections. 
point onwards referred to as E.S.N. children. 
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In a private communication, P. F. Portwood states that when I 
E r on, P. F. .Q. scores 
= ae ee were considered without the R.M. corrections being one there 
ope to be a significant drop in 1.Q. scores after a two-year interval. When 
SDs sh dou were applied, however, the significant drop (due to the increasing 
a nye Ns these age levels and thus to a spurious statistical effect) was 
ovid ee to a slight though statistically insignificant increase. Here again is 
oe nce which appears to confirm empirically the validity of the R.M. correc- 
ns, and the advisability of using them when dealing with T-M (L) 1.Qs. on 


E.S.N. pupils. 


II.—PRESENT INVESTIGATION—METHOD. 
I.Qs. of 111 E.S.N. boys ina 
usually at the age of 8 or 
16 years). All testing 


cal officers or by the deputy headmaster, who 


x For testings below C.A. 15 years, R.M. corr 
bove the age of 14 years 11 months no R.M. corrections are published, yet for 
the 15-year-old age group the mean I.Q. is 102-8 and S.D. 19 (Terman and 
Merrill 1937), and thus needs correction. MacNemar, in his earlier correction 
table (1942) used the same corrections for C.As. 14y. 6m. to 15y. 5m., as Roberts 
Mellone do for C.As. 14y. 3m. to Thus, it was considered 
justifiable to use these corrections the present investigation 
between C.A. 15y. Om. and 15y. 5m. 4 
correction, and at the 16-year age-level S.D. drops to 16:5 (Terma 
1937). Thus, for all test results above C.A. 15y. 5m., no correction was made. 
Means and S.Ds. were computed on these I.Qs., and significances of differ- 
ences between means were obtained, using Sandler's ‘A Test ’ (Sandler 1955), 
which he claims has been ‘ rigorously derived’ from Students ‘+’ Test, and 


corresponds to the two-tail form of ‘ t. 


III.—RESULTS AND THEIR TREATMENT. 

A. Subjects. Of the 111 boys studied : 

_ (1) 69 per cent. came from homes in ‘ urban areas, and 31 per cent. from 
rural ’ areas. 

‘ (2) 71 per cent. came from families whare io fa 
semi-skilled’ (Grades IV and V of the Registrar- ; 
occupation ea as contrasted with 264 per cent. in the general populatia 
of Kent (H.M.S.O. 1954). This finding is in line with the results of earlier 
Tesearches. (Jones 1954). er an 
(3) Only 29 per cent. came from families in the Mg er _occupa 
groups Rer trar General Grades I, II and III), as contrasted with 73 4 per 

cent, in the general Kent population (H.MS.O. 19 4). 

(4) 62 per cent. came from families wh 

{figs contrasted with the normal 21 per 
-M.S.O. ther hand, only 1 

Were ‘ I 4 Reser to 42 per cent. in the general Kent population 

(H.M.S.O; 1954). Eysenck (1953) also reports that 

Tom large families. 
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esults. 
K i” hes the population of E.S.N. boys tested, there is a stati 
nificant ‘ deterioration’ in mean T-M(L) L.Qs. 
(before entry to residential special sc 
special school). (See Table 1). 


stically sig- 
between the ages of 9y. 2m. 
hool) and 15y. 6m. (just before leaving 


TABLE 1 


Test-RetEest Data T-M (L) L.Qs. with R.M. Corrections APPLIED, 


First Testing 


Last Testing 


71-0 
7-91 
47 to 96 


9y. 2m. 
1-72y. 
6y. 11m. to 13y. 4m. 


y. 
l4y. Om. to 16y. Im. 


Difference betwe 


en I.Qs. on first and 
last testing 


5-0 L.Q. points 
0308 


Sandler's A 


Below -001 


+0-626 


TABLE 2 


ANALYSIS OF I.Q. CHANGES on T-M ( 


L) with R-M CORRECTIONS APPLIED. 


LQ. LQ. LQ. 
Deteriorated Constant Improved 


more than +3 points more than 
3 points 


3 points 
N es N % N yA 


61%] 27 244%! 16 144% 
N 


Level of 
Significance 


Total N=111 -05 


= } 


Below 60 
81+ 


| iii. 
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2 

the Fe De torteto coefficient (r=-+0-626) is in general agreement with 
Eysenck (1953) one would expect after a test-retest interval of about 6} years. 
test-retest poe ae: Stott (1960) report that the evidence from thirty similar 
about -04 a a indicates a correlation of -79 after 4 years, with a drop of 
these reda s in the coefficient for every additional year intervening. On 
be -40-69 bit for a test-retest interval of 6} years, one would expect ‘r’ to 
the fact that nereas we find +0:626. This lower correlation may well be due to 

many different school medical officers were involved in the testing. 


3) F 8 

tion ( Loe Table 2, it is clear that many more individuals show ‘ deteriora- 

constant? w. improvement ’ in 1.Q. on final testing. Chi? test, where ‘I.Qs. 
vere apportioned equally to ‘improved ’ and ‘ deteriorated,’ showed 


that thi 

a greater number of deteriorating scores was significant at a -05 level. 
thar RA one sub-divides the data, however, into I.Q. ranges, it can be seen 
Those sil ga to be a ‘ cut-off point’ around 1.Q. 60 on first testing. 
: ys with an initial I.Q. of below 60 on first testing, tend to show an 


Improv 
of ee (more than 3 points) on final testing. Those boys with an 1.Q. 
han 60 on first testing tend to show a deterioration of I.Q. on final 


testing of more than 3 I.Q. points. 

dmi e differences in the direction of 1.Q. change appear to be statistically 
they st , and greater than one would expect by chance alone. It may be that 
mean” partially explicable by the concept of ‘statistical regression towards the 
in diff (McNemar 1940 and Clarke, Clarke and Reiman 1958), since they occur 
Tief ee directions in the two extreme groups, and towards the group means. 
cantiot + that the group mean itself changes between first and last testing 
directis however, be explained on these grounds, as it changes in the ‘ wrong £ 

ion, i.e., away from the mean score of the population. 


JV.—EXPLANATORY HYPOTHESIS. 


A.—Procedure and Findings. 

foul a private communication, Professor A. D. B. Clarke, suggested that the 

vei ikely reason for the decline in scores was that “ the test material is more 
y a biassed on retest (Mean C.A. 15-54), than on initial test ” (Mean GA: 

Wei ie His argument is based on the hypothesis that “ the increased verbal 

sin, ghting at the higher age sible for decreasing the final 1.Q., 

ce commonly verbal asse ously impaired in E.S.N. children 


han are performance assets.” ( Jey and Gulliford 1960). 
he sub-tests of T-M (L) were classed as 


citna er to test this hypothesis, the tests of 2 ) we 

asic, Verbal’ (V) or ‘ Non-Verbal’ (including Arithmetical ’) (P) on the 

nee of the classification of test responses Ph pendix I in Kennedy- 

Pee aol aaa The original test sheet ch were immediately 

final ¢ le were then re-examined, and e l 

ences esting was labelled «V test’ or ‘ P test. Totals and 
are shown in Table 3. 


Bas 

Indications and discussion. an 
Suc (1) On Verbal Tests there is no statistically significant difference between 
Success rates on first and final testings. The direction of the change, however, 


is | 
towards an improvement in “ verbal efficiency- 
(2) On Non-Verbal Tests there is no statistically significant difference 


bet 
Pheer success rates on first and final testings. The direction of the change, 
ver, is towards a deterioration in « non-verbal efficiency.’ 
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TABLE 3 


DIFFERENCES BETWEEN VERBAL AND NON-VERBAL TESTS on T-M (L) wit R-M CORRE C- 
TIONS APPLIED. 


(N Boys=51) 


VERBAL TESTS 


Passed Attempted Percentage 
Passed 


First Testing ..., 352 927 


38% 


Last Testing .... 689 1580 43-6% 


Non-VERBAL TESTS 


Passed Attempted Percentage 
Passed 


First Testing ..., 349 519 67:3% 


Last Testing 


328 58-5% 


FirsT TESTING 


Attempted Percentage 
Passed 


927 38% 58-2 
P=less than 
Non-Verbal Tests £ 519 67:3% -001 
Highly 
Significant 


Verbal Tests 


FINAL TESTING 


Passed Attempted Percentage Chi? 
Passed 


Verbal Tests... 689 1580 43-6% 


12-71 
P=less than-001 
328 58-59%, Highly 
Significant 


Non-Verbal Tests 192 


(3) On both first and final testing there is a 


nificant difference between success rate on N 
Tests. E.S.N. 


up to ages 15 
10 V tests, but six out of ten P tests. 


consistent and highly PH 
on-Verbal tests and on Verba 
n V tests from ages 8 and 9 yen 
ng, they passed only four out 0 


} I that “ the increased verbal weighting at higher ag° 
levels is responsible for decreasing the final 1.0. ” supported ? 


tests. On final testing, however, out of a total of 1,908 sub-tests attempted, 
83 per cent. were V tests, and only 17 


P 
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statistically signi 

ally significant differ 7 

and on final if ifference on V tests between success i 

assume that her let us presume that there is no SO aes ae 
eer cant t ests the success rate is 40-8 per cent. (mean of 38 per cent ai 

assume ptt a aa fie rates achieved). Similarly for P tests we 
é actual Pa cee iene (mean of 67-3 per cent. and 58-5 per cent., 

: S next pres ; 

sixty-four ery me that on first testing 100 tests were attempted—of these 

success rates Just a tests and thirty-six P tests. Applying the differential 

Passed, and Fs i Ss culated, this would mean that twenty-six V tests would be 

£ tests would be passed. Total tests passed on first testing 


a thus be 48-6. 
` we then pres ; 
eighty-thr : presume that on final testing, 100 tests were attem; ted, of which 
Gel e would be V tests and seventeen would be P ee cooling the 
Passed, and tied rates again, this would result in thirty-four V tests being 
fore, be 44-7 -7 P tests. The total tests passed on final testing would, there- 
This. if a opposed to 48-6 on first testing. 
held constant 1e actual number of tests attempted on first and last testing was 
attempted i at 100, then considering the different proportion of V and P tests 
of subtest here would, on balance, be an apparent deterioration in the numbers 
rate) fron s passed, given no change at all in ‘ efficiency level ’ (i.e., in success- 
m one testing to the other. 
d appear that Clarke’s hypothesis is 


tee hypothetical case then, it woul 
ed, i.e., that the increased verbal weighting on final testing is responsible 


o : 
T decreasing the final score. 
the Rioees now consider the actual ‘ raw scores’ from the experiment, we find 
testing a Po for V tests and P tests combined, to be 48-5 per cent. on first 
1S not be nd 46-2 per cent. on final testing. Chi? of this difference is 0-93, which 
TE atistically significant. 
case. The percentages are similar to the proportions passed in the hypothetical 
he fact that in the real experiment the percentage of all sub-tests passed 
ical case, must be due to the 


on ; 

ee testing is higher than that in the hypothetica 

of all ified verbal efficiency ' on final testing. (See Table ). 83 percent. 

Ten? sub-tests on final testing were verbal and the improyed ‘ verbal 

ficie cy , on these outweighs the influence of the deteriorated ‘ non-verbal 
ney,’ on the remaining 17 per cent. of tests taken. 


Gag 
eneral discussion. 
with verbal sub-tests from Yr. VII level 


This ‘ over-weighting ' of T-M (L) 
Clarke and Clarke 1958 and Bull. 
nt investigation would tend 


n 
Brit es has been criticized elsewhere ( 
O su sychol. Soc. 1958). The results of the prese i te 

signif t these criticisms, though not at an acceptable Jevel of statistical 
Specie: The fact that T-M (L) “markedly penalizes ’ the dull person, 
aea if he is from ‘poor or adverse surroundings,’ would make it an 

altho able measuring instrument for use in the testing of ESN. children, since 
SE it ‘indicates his verbal handicaps ° effectively, it fails to give us an 

ound assessment of his wider intellectual capacities. 
e used throughout, 


Tn the present study, Roberts-Mellone corrections wer 
y since the publication of their 


a 
ong edure which has been confirmed empiric 
(1958 al paper (1952), following MacNemar (1942), by Collman and Newlyn 
ae by Hiskey and Sadnavitch (1958), by Scarr (1953) and by Portwood 
€ communication). Such evidence would appear to outweigh the argu- 
clear, neither are his basic assump- 


Men 
ti a Stott (1960), which are not always 
niversally acceptable. 
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tt’s findings (e.g., greater deterioration in the scores of girls 
He ee oe that ee ih aie deterioration which he finds may E 
due partially “ to real changes in the rate of mental development ” of ake 
children, yet it is notable that he concludes, “ this is not to deny that part o a 
variation is also due to faulty standardization.” Present results would sogge 4 
that the deterioration is partially due to the actual structure of the test, w te 
varies in its composition from one mental age level to another, and tends e 
lower the scores of those children with comparatively poor verbal ability; a 
direct proportion to the verbal weighting in the selection of sub-tests used a 
any one testing. . 
Stott’s (1960) paper fails to supply 


chronological ages of the children tested, about their mental ages, or about the 
average re-test interval for the childre 


n re-tested after 3 years 5 months. The 
present study offers data on a more ho; 
comparison between scores at junior scho 


adequate information about the 


ues remain to be considered. The work of the Clarkes 


, of Mundy (1957), and of Penrose (1949) strongly 
Suggests that for the sub-group we have been 
constancy of ‘ mental efficiency ’ (and thus of LO 
after the age of about 15 years, tk tked improvements in 1.0. 
Clarke's (1959) hypothesis is that such increments amongst the ‘ feeble-minded 
represent “ not a response to pre: itions, but rather a recovery from 
early deprivation,” maternal or environmental. This work by no means con- 
tradicts the well-known thesis of Bowlby (1951 and 1956) on deprived children, 
whose follow-up studies have, unfortunately, as yet all been on children below 
the age of 14 years ; it merely has a bearing on the long-term outcome. 
O'Connor (1956) points out that the concept of deprivation is still confused; 
but that one important variable in the normal development of abilities and 
8 environment.’ This p 


They found that amongst pre-school children 


and competitiveness (those from 
personality traits which resu 
Such children seem to develop 
tive for learning. 5 
- boys from ‘ working her 
Occupation groups, from which 71 re 
» Score consistently better on the 989) 
verbal tests. Bernstein (1958 and 19 a 
Messenger boys in adolescence from ‘ unskilled ae 
kille 206 scored Matrices T.Qs. of 1014+ while the mear“ 
of their Mill Hill Vocabulary scores fell below this. al 
: y Hollingworth (1920) and others that. in subnormiy 
children the curve of ‘ mental development ’ rises constantly, but more slow 


tion were derived) 


(L) than on the 
found that of 309 GEO 


<a 


= 


——————— o I 
i a -e 
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than in nor z 
nimal re , and that there is a cessation of growth at an earlier age than for 
hans linea? a If this is so, and the curve of mental development is ‘less 
support has b ie n the LQ. of subnormal children must inevitably fall. Some 
Phillips (19 K een given to this hypothesis by earlier investigators such as 
T-M Fa ‘hav h In more recent studies, however, where the varying S.Ds. of 
Coltiead gh corrected for, no such deterioration has been found, e.g., 
Weilavecnan t ewlyn (1958), Hiskey and Sadnavitch (1958) and Portwood. 
seen too in the work of the Clarkes that marked rises in I.Q. level may 


occur a a6 na . 5 
at adolescence ; and Davis (1957) has challenged the earlier assumption, 
ly important in influencing 


ant in determining the 
between the efficiency of 
ts. We know that they tend to come 
c ias , encouragement and 
oncern tend to be lacking, and where oth have pra children 
(Nisbet 1953, Anastasi 1956). Such 

1950), tend to do 

perc 3 sts’ whether they are of a performance or 
eptual type. Pringle and Bossio d evidence that groups 
Verbal than Performance W.LS.C. I.Qs. 

a residential special 

Verbal I.Qs. These 
at “ many of their 
t provide the best 


Writers suggest that 
nection, Bernstein's hypo- 


eager come from broken or unstable 1 
the itions for language developmen 
ses (1958 and 1959) offer a possi 

an In view of the recent appearance of Form L-M (Terman and Merrill 1960), 
whig Prasal was made of the relevance of the present study, to the new form, 
K ich seems likely to supercede Form L eventually. As in Appendix 1 of 
lige raser (1948), and as in the present study for T-M (L), the sub-tests of 
V e new Form L-M were similarly categorized as either ‘ Verbal’ or “ Non- 
erbal’ (which includes « Arithmetical ’), on the basis of the kind of test 


Tesponses elicited. 
Kennedy-Fraser (1948) reports for T-M (L) the following percentages of 
y Non-VerbalÌ ’ sub-tests (including Arithmetical), for M.A. groups Yrs. II—V, 
ts. VI—VII, and Yrs. VII—S.A.3, 59 per cent., 19 per cent. and 26 per cent., 
me Pectively. For the same three M.A. groups, 0n Form L-M, the percentages 
Non-Verbal ’ sub-tests were found to be 61 per cent., 16 per cent. and 30 per 
ee respectively. Thus, as far as the ‘ Vi erbal '—‘ Non-V erbal ’ dimension in 
ae composition of the new Form L-M is concerned, there would appear to be 
ward little difference with T-M (L), ‘and, in fact, many of the sub-tests are the 
ame, The conclusions reached in the present study would, therefore, appear 


to apply in general to Form L-M. 


D RECOMMENDATIONS. 
better on the Non-Verbal 
ne with earlier findings on 
and performance 
his may well 


V.—CONCLUSIONS AN 

th (1) E.S.N. boys between 7 and 16 years score | 
an on the Verbal sub-tests of T-M (L). This 1s n li 
a € relative efficiency of intellectually inferior groups on verbal 
Sts—E.S.N., deprived and delinquent children and adolescents. E 
related to the sociological findings that 
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i i i i from homes 
.S.N. boys in this Kent residential school tend to come r 
eee he is vesicle or semi-skilled, i.e., from the lowest two occupation 
groups; and from larger families of three or more children. 


is experi i i ive indications that 

3) On this experimental population, there are tentative ind ma t 
isin the mean ages of 94 years and 154 years, using T-M (L) tests, * v erba 
efficiency ’ improves somewhat, while ‘ non-verbal efficiency’ deteriorates a 
little. Despite these findings and despite the fact that Roberts Mellone correc- 


tions were used throughout, there was a statistically significant decline in I.Q. 
as measured by T-M (L). 


(4) This finding may be explained partially at least by Clarke’s hypothesis, 
that it is due to “the increased verbal weigh 


This is not to deny, 
may be operative in causing such a 


e of mental development,” due possibly 
deprivatory environment, or to lowered 
“mental organization and conceptualiza- 


to the cumulative effects of a culturally 
motivation, or to a lowered level of ‘ 
tion,” may also be important. 


) would appear to be a misleading test 
Although it is a predominantly 
al items passed improves slightly between 
9% and 154 years, yet its structure is such that the I.Q. appears to decline. 
i i ull child from a poor or detrimental 


environment ; it fails to give an all-round assessment of intellectual capacities ; 


and the different types of items are inconsistently and unequally distributed 
through the different mental age levels of the scale 


(6) In view of all the difficulties mention 
question would appear to need very 
educationally subnormal popul 
being considered. 


ed in this paper, the test under 
careful interpretation when applied to an 
ation, especially if changes in cognitive level were 


-linguistic abilities and disabilities of sub- 
l agnosis can lead to effective remedial treal 
ment. It is as yet too early to judge the effectiveness of such a differentia 


© question is considered by Cashdan (1962)-) 
(8) The problems discussed in 


it possible for some investigator to do a comparable test-retest study 0? 
children at the other ex i i 


— | 
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(9) There w 

Verbal’ would appear to be little difference in the ‘ gui 
oe op ere of T-M (L) (1937), and Form L-M (1960). a the PE 
asto Form L. recommendations would equally apply to the new Form L-M, 
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THE DEVELOPMENT OF CHILDREN’S IDEAS ABOUT 
COUNTRY AND NATIONALITY 


Parr II: NATIONAL SYMBOLS AND THEMES 


By GUSTAV JAHODA 
(University of Glasgow) 
— E The learning of conventional national symbols (national anthem, 
ngs, flags) and the understanding of their significance at various stages were 
d to explore more subtle 


oe A specially devised picture test was use 
eee of national consciousness. The broad developmental trends and the 
ocio-economic group differences emerging from both parts of the investigation 


are outlined, and some educational implications suggested. 


I.—INTRODUCTION. 


IN the literature of the social sciences a good deal of lip- 


importance of national symbols in the building up of national sentiments. 
te as follows: “As a rule, personal 


a prane, G. W. Allport (1950) wro 
oe ty can adhere to an abstraction only when the abstraction is richly symbol- 
ti d. Christianity rivets attention upo focus on their respec- 
tive flags.” Few attempts seem to hav quisition of 
such symbols in young children, apart from the work of Horowitz (1940) and 
recently Lawson (1963), who concentrated on national flags. The aim of the 
present study is to initiate the exploration of a somew 


symbols. 


service is paid to the 


ee II.—THE NATIONAL ANTHEM. É 
Six different tunes were played to the children on a tape-recorder in a 
were Scottish, two Irish, one English, 


Systematically rotated order. Two of them h 
and the National Anthem. They were asked to say of each if they had heard it 
before ; if recognition was claimed, the following questions were asked : Where 

untry does this tune 


ag you hear it? Do you know what it is? Which co c h 
elong to? Here only results relating to the National Anthem will be given. 


Table 1 shows the different kinds of responses. About one-quarter of the 
youngest children gave no sign of re nition. The second group said they had 


heard it before, but were either una y the country, or gave absurd 
answers like “ From the country where Herod ), or even 
It’s from Wagon Train.” 
TABLE 1 
ses TO HEARING Natl 


cog 


ble to specify 
is ” (note the tense 


ONAL ANTHEM. 


RESPON 


M Schools W Schools 
ee eee ——_ 
6-7 8-9 | 10-11 6-7 8-9 | 10-11 
Fails to recognize tune; --+-- jow 6 — — 7 1 z= 
Claims recognition but does not know z íl ê A 
_ country, or absurd response -.--+:: 13 4 2 
Claims recognition and attributes to 5 r A 
SeStlind  seajenuade cements, 1 4 2 4 
Eapana ng eyenes eet ee ; 
Ditto and attributes to Britain --++- 
24 
Per cent, unable to give title of tune ---- — 
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is point attention may be drawn to the bottom row of the table, 
E frequency of oe ele of the name. For this oer 
“God Save the Queen ” or “ National Anthem were accepted; o 2 s 
rejected, five implied awareness that the tune was linked with the eee 3 
such as “ The royal song ” or “ The Queen’s music.” Beyond the age m 3 
inability to name the tune was rare, and in general it was found more ofter 
among W children (P less than -02). 


Turning now to a consideration of err 
trends. The first, most common with W 


Scotland. The explanation which Suggests itself is that these children er 
unlikely to have attained a conception of Britain as a whole, and a check a 
G-stages supports this : 76 per cent. of them were at or below G.III, as compared 
with only 30 per cent. of those answering correctly (P less than -001). The causes 
of the second error, namely mentioning England or London, are less evident. 


In some cases at least, judging from the replies, it seems to have arisen from an 
association in the children’s minds between the Queen, her residence in London, 
and ‘her music.’ 


The pattern of development that emerges from the comments (though 
owing to limitations of time it was not possible to pursue it systematically) 
may at least be broadly sketched. At earlier ages, the National Anthem tends 
to be just one particular piece of music among others. The name is largely 
unknown, but it is connected with such occasions as school ceremonials, Church 


‘and Sunday School (e.g., ‘‘ We get it at concerts ” ; “ In the service in the hall— 
sist Tere hymn”) ; 


ors of attribution, one finds two major 
children, was to claim the anthem for 


the big ones know the wo 
wireless and, especially among W children 
at 8-9, but there were some additions. For M children, these consisted mainly 
of Cubs’ or Brownies’ displays, with V 

occasions like “ Away up the town wl 


Tt was on the T.V., when they stand up and sing. (Why do they stand 
up?) Don’t know, (W10). 

They play it at the start of an international game or something. It 
could be English, Scottish—anything. (W11) 
Thus, there is a slow and almost imperceptible transition in the meaning of 
this music for the children, At first it 


s may be quite unfamiliar or, if recognized, 
quite wrongly labelled. Later, it comes to be connected first with the person sa 
the Queen, and then with Britain as a whole; at that stage it was almos 

invariably correctly named. It was striking to observe how older children, wh? 


had already grasped this, responded to the questions with a contemptuous 
laugh, because they 


hey felt the answers were so obvious ; one would hardly hav? 
guessed that their insight is likely to have been a fairly recent acquisition 


III.—NATIONAL soncs 
Whilst the National Ant 


there are many popular tun 


* One-third of the 7-year- 
when it’s finished.” 


. + sacjonsS 
hem is confined to more formal or ritual orcas 
es tinged with patriotic sentiments Furtherm 


V- 
old W children were able to report that “ It’s on the T 


ee eel ie Se 
OS er. 

. ee a 

i 

—— 
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tend to be associated with 
ted Kingdom. The tracing 
ng the following question : 
places and are a long way 
which ones do 


by ” à 

ico with the National Anthem, such tunes 

z ae units which together constitute the Uni 

S opment in this sphere was the object of putti 

aa Peeples a this country go to live in far-away 
l Now imagi yw ik 

Fon thinks they would ine y would like to hear three songs : 


mie were divided into three categories. 
Ant SR give the name of any song, together wit 
s. For all these the question effectively meant “ 


three 5 iat ; 
songs.” The youngest children in this category cams out mainly with 
“ Jesus loves me,” “Away in a manger,” 


ae songs (‘‘ Gentle Jesus,” “J 

ana Siac ace age, irrelevant songs tended to become increasingly secular, 
Clans” ¢ any ergence between M and W emerged. “ I saw Mum kissing Santa 
opera ”) or ae are Rock ” versus “ Copelia ” or “An adult might like 
SA Also included in this category, because their number was relatively 
seutticd nai what might be called transitional answers. In these, the child 
bridge th have some inkling of what was required, but was as yet unable to 
fat fom hk gap imaginately by putting himself into the position of someone 
wild wi rei Examples of this would be children who suggested such people 
might vish to hear Italian, Indian or other foreign songs ; one boy said: “ We 
Pis R to France next year, so I think they should be French ones”; in 
E s terms, this would be a typically egocentric statement. All the responses 
ar are grouped under * 


The first consisted of those 
h those who produced irrelev- 
Tell me the names of 


‘Nil or irrelevant ” in Table 2. 


TABLE 2 


Sones FROM HOME. 


W Schools 


M Schools 
g-9 |10-11 6-7 8-9 |10-11 


14 4 


a or irrelevant response «+++ +2577" * 20 14 5 21 
nderstood and produced none or one .- 3 4 7 2 
1 6 12 1 


Understood and produced two or three.. 


who clearly understood 


d of children 
' or words to that 


s compose 3 
t to hear a Scottish song,’ 
ll, or were able to 


z Scottish song at a y 
-Scottish answers 


wh The second category wa 
x hen was required (“‘ They will wan 
ee a but ejther could not think of any 
Süd e only one. There was a small sprin.: C 

h as ‘ Carry me home to blighty,”’ which were, of course, cred 


category was the same, except that the children knew two or three songs.* 
of this ques t that it is one of the few 


ch failed to yield any significant difference and W. From this, it 
ay perhaps be in establishmen ciation 
i f the homeland is largely independent of 

ce and social background. It would seem that 
nd undifferentiated notion of, in 

link, Thus, none of the 


kling of correct non 
dited. The third 


CO: 
Peetu ly the presence of a 8 
case, Scotland is sufficient for the form: 
entioned included : Auld Lang Syne, I belong 
the gloamin’, Loch Lomond, A Gordon for me, 
nd braes, My heart's in the Highlands. 


Sau, Those most frequently m to Glasgow, 
Ty and the Brave, Roamin’ in Song of the 
e, A hundred pipers, Ye banks a 
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; iority of G.III over 
; derstood the problem ; however, the superiority TLL 
Ci DE not significant (P between -30 and -20). It is also likely Be 
children in both kinds of environments are about equally exposed to patrio 
songs, learning them at roughly the same rate. 


IV.—THE UNION JACK. 

Nine coloured cards picturing, among others, the Union Jack ane AoE 
Stars and Stripes, were spread out in front of the children. At the outset ne 
were told to point out those they had never seen before, and these were temon ar 
Next, they were asked the names of the remainder, and the countries to whi 


they belonged. Finally, children unable to comply were informed that some 


of the flags were Scottish, with the request that they should try and pick them 
out. The last move turned out 


to have been a mistake, hence, unfortunately, 

no useful material was obtained regarding the Lion Rampant and St. gs 
Cross.* The report on the findings will, therefore, be centred on the Union Jac l 
Results are set out in Table 3. First of all, there were two 6-year-old wW 
children who placed the Union Jack among those they had never seen : u 
are included in the top row. It will be noted that the children were genera > 
able to name the Union Jack correctly before attributing it to Britain as a whole 


rather than claiming it for Scotland only. The M children were significantly 
more successful on both counts (P less than -01). 


TABLE 3 


THE UNION Jack. 


M Schools W Schools 


8-9 | 10-11 


Does not know country or mistakes 
other than Scotland 

Calls it Scottish 

Correct 


Per cent. knowing the name of flag .... 


Number of childre 


n able to identify 


The youngest children were apt to link the flag with concrete situations i” 
T spontaneous comments, as shown below : 


It’s the flag you always put up when the Queen comes. (M6). Aunt 

That’s the one you get at the circus—my daddy buys it for me. (W n 

country ?) It’s the Glasgow flag. (W6) zE 

, The Union Jack was often called “ the Queen’s flag,” indicating an associ? i 

tion already seen in connection with the National Anthem. Beyond the age E 
grosser mistakes were mainly confined to W children ; apart from calling 


e of a pervasive ten 
re bound to score by 


thei 


* This was becaus 


à «u gcot- 
ish,’ dency to call the whole of the remainder Se 
tish,” whereby they we: 


chance alone. 
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variety ‘far; ee 
y of different flags “ Union Jack,” they sometimes used the expression in 


a completely ünromprelieadng manner : 
gm that’s the Jack flag. (What country ?) Don’t know. (W9). 

iig that s the Union Jack. (What country ?) Is it France? (W9) 
tine ont a to this, the bottom row of Table 3 shows that W children did 
gnificantly worse than M ones in attributing the Star and Stripes to the 


U.S.A i 

beini a Ri might be added that a good many more recognized the flag without 
able to specify the country. Here is a revealing instance : 

I’ve seen him on T.V. ; he fights the 


„That’s General Custard’s flag ; 
Indians. (W7). 
Seno a few impressionis 
ish emblems. They appeare 
influ’ the older children were able to name 
eno: of television again see’ 
rit it in connection with the Lion Rampant, 
hadi sh Television. Some of the M child: i 
that arnik, it with the Cubs or Brownies. On the 
symbol ithin the age-range studied, the Scottish emblem 
ols of Scottish nationality. 
: V.—RECOGNITION OF SCOTTISH PICTURES. 
ai This procedure was devised with the object of assessing some of the subtler 
spects of children’s knowledge of their country. They were presented with 
in Plates I-V, and require 


be offered regarding the 


d familiar to many children, though few even 
them correctly. The pervasive 
as several children 


tic comments will 


d to indicate for 


ro of three pictures, reproduced 1 
ach set which—if any—of the pictures is connected with Scotland. 


PLATE I—CostuMEs 


PLATE II —LanpscaPEs 


iy 


1 
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PLATE III — BUILDINGS 


S 
A 
k 
m 
m 
Z 
l 
> 
| on) 
m 
H 
< 
= 
= 
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Table sere f . 

number of Sem oe results in a somewhat different form, being confined to the 

possible chance sı es for each sub-group of children. No correction was made for 

ie reah ten successes, as 1t was evident from the children's comments that 

dealt with a: sae pasar The first set, mainly intended as an encouragement, 

With this. Wj a and costumes. The great majority of children had no trouble 
ith the landscape set, where a typical Scottish scene had to be 


picked 
out, W children were conspicuously less successful (P less than -001). 
travelling around 


The di 
ifference probably reflects differential opportunities for 
in the extent to which land- 


the ; 

= hea and possibly also differences ™ 

E y en neo their homes, though the latter is entirely speculative. Similar 
ations would apply to the next set, three buildings including Edinburgh 


Castl : 
e (P less than +02). Its noteworthy that several M children were able to 


identi - 3 
ify Notre Dame in Paris, one of the ‘ distractors.’ 


_ After thi : 

in ran this came a set of conventional emblems, and here the rates of success 
Significant] oni the ubiquitous Scottish thistle differed even less, though still 
y (P less than +05). The last set, featuring “ Rabbie ” Burns in 


com $ 
pany with Shakespeare and Voltaire, produced a surprise. There was no 
ly choosing Burns as the Scottish 


signifi ! 
as, a M-W difference, either in correct 
, or even in knowing his name. Evidently Burns constitutes one of the most 
as childhood. Below are some 
”. 


O . ; 
et Scottish national symbols, as early 
ing answers to the question “ How do you know this is Burns? 


My grannie’s got a picture of him. (M8). 
Because we have a calendar with his face on it. (MT). 
___When your mummie’s in hospital you £9 into a fish-and-chip shop for 
ties dinner ; and when I looked up I saw a picture of him looking at me. 
(W6). (N.B.—This boy recognized neither the National Anthem nor the 
Union Jack.) 

On the T.V. news—We $ 


ae It would seem that Burns’ presence 1n Sco 
social class, and that is no doubt how he would have W 


aw his cottage. (W7). 
i ttish life cuts across the barriers 


ished it to be. 


TABLE 4 


RECOGNITION oF SCOTTISH PICTURES. 
f 24 who succeeded on each task.) 


(Numbers of children out o 


M Schools W Schools 


SS 
6-7 8-9 | 10-11 


= 
8-9 10-11 


23 


soes on a ee 
ee Re EOE T 
... 


e A N 829 


ea OAF E EER 
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VI.—ATTITUDES TO OTHER COUNTRIES AND PEOPLES. 


s , A ; sewhere la 
The findings relating to this are reported in detail elsewhere (Jahoda, 
1962), and this section, therefore, Offers Merely a sketeh of bröad trends z 


: E, offers mi 
some features 6f BECA] IRteeet in thig COURLEY. i 
The children's growing awareness of mute distant hatlons besas with casui 
glimpses of unusual picturesque detail divorced from any general backgro ally 
This was sometimes combined with concrete and homely, though equi m 
fragmentary information gained through such personal channels as relatives 


; -AIia 
$ ifferentiati and organization, too com 
abroad. he subsequent Process of differentiation and GA i Pa 
plex for brief Summary, was studied in some detail in relation to America a 
Russia, 


ally, these are probably 10) 
e nentioned only hegar 
an early mode of thought. Apart from the attractior 
avourable attitudes seeme : 
lief in the similarity of oa 
ing of affinity ; this last statement, it sioa a 
ve and could not be directly verified from a 
from a surprisingly early age, have their root: 


Idest or most mature children these are over- 
shadowed by present political conflicts, 


evanescent to deserve th 
they are characteristic of 
of * holiday countries,’ th 
to be again the existence 
people to oneself, resulting in a feel 
be added, is somewhat speculati 
data. Unfavourable attitudes, 


the form of colour prejudice, and a 


ok down on people in other countries. It is interesting t° 
note, however, that their attitude to the English appeared to be influenced by 
considerations of sor ial class Thus, a higher Proportion of W as « ompared with 
children thought the English ‘nicer’ in behaviour (e.g., “In London the 
men are not so rowdy when they're drunk"); moreover, they tended A 
identify an English accent with middle class speech, and to regard the Scottish 
tongue as a working class one, as may be illustrated by two examples : 
Well, the English Say “ Good jolly, this is a jolly fine day.” The 
Scottish say “ Aye, it's a grand day.” The English are a lot of toffs. (W9) 
They talk different—kind of polite, kind of Cockney . . , (W11). 
Lastly, a few general remarks on attitudes towards the English. From un 
way the children talked about them (and it must be remembered that w 
largely reflect the expres ini f thei i 
traditional antagonis 


disparaging remarks about the ‘ 


There were hardly any 
dislikes, not a single 


» and when questioned abou 
the English. 


fe 
. š . n’s 
gains from surveying the childre 


, ; ; her 
3 sions of investigators concerned with ot 
areas of children’s thinking, j i 


ue: Y Wiener ae 8 


ee - -— ae 


i 
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them; some fi 
tie > five years lat 1 i 
mentally different: s later the outlook of most of them is no longer funda- 
they ally different from that of mature adults, During the intervening years, 
ane eed a git of conceptual tools enabling them to organize their 
JUL! goiiel. $ and to make the main distinction conventional in 
ever, tt w details ol this process have been described in this presdill study. Hews 
Canine asi a essury to treat the varic spects i d s iq isolat 
running a the question as to „iha arigua napadle ee ied 
Pe through the advances in seemingly divergent spheres. An analysis 
probable ty MUR EHA material as a whole indicates that the central element is 
entiating bart e ound in the extent to which the child has succeeded in difier= 
and thie a ordering ever-widening sections of his geographical environment ; 
appear ert a at least roughly assessed in terms of the G-stages. It would 
at the particular G-stage reached is an index of the limit to the amount 


Of progress possi : 

nT po pees elsewhere ; children often lag behind, failing to realize their 

achievement, but there is no indication that they can exceed it in 
arrot recital. 


enui 5 
Am understanding, as contrasted with mere p: 
advie S, the youngest children investigated, nearly nine-tenths had not 
T Ben beyond GIL. It is thus legitimate to generalize about the age-group 
diversit ass their outlook was fairly uniform. After the age of 7, the increasing 
; sity of G-stages precludes any general statement, which would be mislead- 


ing, 
denotes a perspective confined to the child’s 
school and play streets in which they spen 


It may be recalled that G.I. 
nsion has taken place, involving some 


immedi 
masiat surroundings—home, 
E G life. With G.II an exte: ; 
and of ni of Glasgow as a unitary whole, to which the child refers himself 
Deen fi i ich he feels himself to be a part. As yet, however, no conception has 

ormed about what lies outside. From this starting-point, an inventory of 


he idea : 3 > 
he eas characterising a majority of the children aged 6-7 will be made. 
itst of all, the meaning of even the simplest geographical words tends to 
“ig liable to be applie 


gether mistaken. Thus ‘ town 
not only inadequate 


be ey ty, 
$ extremely vague, or altog ben A r 
any aggregate streets upwards. This reflects.. nades 
eee or Li but at the same time also inability t9 
both are really 


eor 4 
classi of this particular conce) 
sify known geographical names in any coherent order ; 
our country?” the dominant 
y, I live in a 


two si e 
ancy tes of the same coin. sked “ what is your co 
doses ae “ Glasgow”; one replied “ I dont live in a countr 
reali; Whilst they have heard the word “ Scotland, and if G.I usually 
it ah that it is a place of some kind, they cannot locate it properly in space ; 
rita hought of as being ““ away ”” somewhere. Requests to say something about 
except elicited a collection of absurdities. Roe they achieved, we nen 
s ions, was to put forward notions 1MpP ying recognition at Britain 1 
, a kind of pane ial entity, though invariably a remote one. lca 
ni ols relating to Britain as a whole, such as the National ae or ae 
failed Jack, were at best attributed to Scotland ; a majority of children in G. 
to recognize them altogether. ro. : à 
and When asked whether they were British, a majority of the children nGa 
ing zL replied negatively ; the same answer was given by nearly all chi me 
tho -I to the question as to whether they were Scottish ; but two-thirds © 
ee in G.II affirmed that they were Scottish. In view of the evident confusion 
valent even among the latter, this result may appear surprising : why should 


but extends to the common stairs of a 


* 
tence strictly an entrance OF passage, 
nt, or the building as a whole- 
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: 5 = a rt of 
eady to acknowledge being Scottish, when they clearly fall shor 
O the relationship between Glasgow and Scotland? In pan a 
explain this, one has to abandon the exclusive concern with purely cog hee 
aspects. These children are growing up in a symbolic environment an el 
part of their environment consisting of words, signs, etc.) where oe a 
things Scottish are bound to figure rather prominently. Hence, even be i 
they are capable of building up a coherent conceptual framework Se 
categorize their geographical surroundings, they will become familiar with a 5 
words as “‘ Scotland ” and “ Scottish,” as well as to a lesser extent pee ‘4 
and “ British.” Moreover, some broad and loose associations, normally with 4 
favourable emotional tinge, will tend to be established between these words ang 
certain items of their experience. This was often apparent in earlier pages : 
children were reminded by the word “ Scotland ” of their family, of Highlan® 
Dancers, and so forth. An accumulation of such associations leads to a neadiny 
to acknowledge themselves as “‘ Scottish ” ; but until such time as an re en 
concept of “ Scotland ” and “ Scottish ” has crystalized, such quasi-identificé 
tions remain unstable and apt to be withdrawn when challenged. . : 
The purpose of labouring this point at some length is to show again 1o 
young children are sometimes able to use these words in a formally CoR 
manner, when more careful probing would reveal that conceptual grasp $ s 
eludes, them. In fact, during the first two or three years of their school caree!, 
most children have hardly begun to make sense of the wider geographica; 
Social and political setting around them. It will not have escaped the readers 
attention that it was necessary to qualify statements about the younges 


children, because even they were far from uniform. Thus, one 6-year-old was 
already in G.IV, whilst at the oth 


er end of the scale, two 10-year-olds had not 
moved beyond G.II. This variability of individuals cannot be stressed t00 
strongly, because it is in danger of being overlooked when one thinks in terms 0 
age trends, and this applies most of all to the intermediate ages, A majority 0 
the youngest children are likely to be in G.I or G.II, and most of the oldest E 
G.III or G.IV, but those aged 8-9 are distributed over the whole range. a 
holds true for children from M and W schools taken separately, as well as fo 
the total sample. 
In general, the children from M schools 


advance of those from W schools ; however, when M and W children wet 
matched for intelligence tes 


i t score, the difference disappeared. It was, therefo ee 
concluded that this factor accounts for a large part of both individual variatio 
and group differences. 


were found to be significantly 1" 


VIII.—SOME EDUCATION. 

“He that hath n 
speaks only empty sounds.” It would be entire 
add that the passage from na 
by fully elaborated concept 


nts meaning in his words, a 
ly in the spirit of John Loc ked 
mes as more or less empty sounds to names bac zest 
s is the outcome of a gradual learning P100 e 
or not a child who has a E nat 
y readily be led to assume ig 
e within the reach of the child. One m is 
example. : “ The relation also that things on over 
and distances is very obvious to observe ; as aP? j 


to 


A such statements, along the lines sketched 1 
present study, will indicate t 


n 
: s ro 

À hat to a child such relationships may be far f 
obvious, 
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What beari 2 
aring does this have on the teaching situation? The simple answer 


would be that tk 
Capacity for arises teacher should not jump too ‘dren’ 
ahendeton adequate. comprehension ; but this is cee ee ap en 
conceptual deve prasi the teacher could not hope to ascertain the stage of 
some compromi opment reached by each child, and even if this could be done. 
sideration of og would be unavoidable. More effective action would be a recon- 
a exanivation ae of teaching such subjects as Geography and History. 
keeping in mi Hy the Schemes of Work for teaching these in primary schools 
ind the findings reported, will reveal the presence of numerous 


assumpti S 
ge OETA ical pan are likely to be invalid; for instance, what kinds of ‘ simple 
rms ’ are regarded as having to be taught, and which ones seem 


t > 

£ Ee tacitly taken for granted ? 

Bee pose suggestion is that some account be take: 

concept devel what appears to the dominant, if not universal sequence of 

out, say, a Sead gs from the immediate vicinity outwards. This would rule 

or map L . p from plans of the school room and adjacent streets to the globe 
e world. The main aim would be a consolidation of the series 


Wideni 
he Sa from the home town to Britain as a whole. 
from the y, it is important to guard against a possible misconception, arising 
child mu pee discussion of developmental stages. It has been stated that a 
st attain a certain level of intellectual maturity before he is ready to 
he must be capable of performing the 


dling spatial relationships. 


n in planning the 


Most childr uch a view presupposes that 
i ren more or less automatically advance to the limits of their poten- 
the truth. Even in the course 


tialiti : 
ities, and this is, unfortunately, very 
unintended co: 
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THE SELF CONCEPTS OF TRAINEE-TEACHERS IN TWO 
SUB-CULTURES 


By A. S. PHILLIPS . 
(Department of Education, University of the West Indies) 


Summary. (1) The aim of this study was to compare the Self Concepts ol 

training college students from two different sub-cultures, viz., Englan 

maica. ` 

Ja (2) Unstructured compositions by students from both countries, pa be 

topic—“ I myself: the person I am; the person others think I am i ae 

person I would like to be,” were analysed to reveal the categories studen ee 
in describing themselves. Some interesting qualitative differences appeal 
though students from both countries used the same categories in their 
description. aaae CO 
(3) Objective tests for measuring the Self Concept of students were me 
structed and administered to other students in both countries. The follow 
comparisons between the two groups emerged : wall 
la) Jamaican students, both male and female, of the regular age range as ad 
as the mature, rated themselves significantly higher on the Cognized see 
the Ideal Selves, and were more Self Accepting than their English coun 
parts ; 

(b) English male students rated themselves at 
as English female students, 
students tended to rate them: 
Self Accepting. of 

(c) There were no singificant differences between the Self Concept scores ah 
‘regular’ English and ‘mature’ English students. But the ‘ matur 


5 : poi the 
Jamaican students cognized themselves significantly lower than th 
‘regular ’ Jamaican students. 


el 
approximately the same ies 
But, in the Jamaican samples, the dere 
selves higher than the men, and to be 


I.—Intropuction. 
WITHIN recent years the Self Concept of the individu 
and more attention from psychologists. It has come 
part of personality structure, exercising a profound 
happiness. According to Lecky4 one 


al has been receiving mor 
to be regarded as the pin 
effect on behaviour and PA 

of the functions of the Self is to produce s 
harmonious integrated personality. This is achieved largely by an accep a 2 
of oneself as one is, not in a sense of complacency, but with a full knowle E~ 
of the things about oneself one cannot change, and of the endowments one ha 
to work on. Such understanding leads to mental health. Besides, our genni 
behaviour is to a large extent, regulated and organized by-what we perce! 
ourselves to be. 


If the individual, ably happy, normal, alr 
adjusted life, free from fears and tensions, from feelings of constant self-reproa 
and utter inadequacies, it is fairly clear that he should i 


education. Staines3 h 


4 P 3 5 inar. 
Self Concepts of children, without any lowering of achievement in the ordina i 
school subjects, by merel 


, y adapting the methods of instruction Iı the 
Concepts of children are 


0 
‘ of such vital importance, then the Self Concepts 
trainee teachers must be of 
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II.—THE JAMAICAN SO 

: The Se 7 J CIETY. 
might be e s students from the Jamaican and English sub-cultures 
iene ael o iffer widely as the social forces in these two countries 
Many races com = Pes: has been an English colony for over 300 years. 
and the negro F, e population, but the two dominant ones are the white 
brought teat mp the beginning of its colonial history, these two races were 
white peo ye her with very clearly defined places in the social hierarchy. The 
position ve gunea all the property and all the wealth and were thus ina 
slaves athont hority and prestige. The negroes were, in the beginning, merely 
came. secur d rights—the chattel of their masters. Emancipation when it 

DE colon Sa the physical freedom of the slaves, but the pattern continued. 
SOME tno m people were made to feel inferior in so many ways, some subtle, 
Stirs ra brutal, that it was virtually impossible for them to doubt their 
did aie fo. was strengthened by the processes of education, when education 
the Unit bas coloured children. Its pattern was copied almost entirely from 
United F = Smee The syllabus of the schools was that designed for the 
tions and ie om. The standards and the ideals were theirs. Even the illustra- 
Englan d On pictures presented to children were those prepared for children in 
in an ill y nly in rare circumstances would a child see a negroid type depicted 
hacks ustration, and even then it was usually conforming to the stereotype of 
angen native.’ Even the language they were taught was full of expressions 

tie ing black with bad or undesirable, and white with good or desirable. 
Shay image held up to the child then, particularly in relation to physical 
racteristics, would be that of the European, and the European type would 


Certainly form the Ideal Self picture. 
ofhi T his probably explains the coloured person’s rejection of the negroid part 
and ; ancestry (there has been much intermixture between the races in Jamaica) 
the BaF desire to escape from it, as typified by such practices as straightening 
hair and using bleaching creams to i Ti 
Ta Though the more obvious forms of discrimination no longer obtain 1n 
h maica and there is fairly gener: all races, never- 
sear the attitudes still persis ed to produce Self 
secre full of unworthiness and inadequacy and marked by excessive rejection 
t ‘Self. There should, therefore, be marked con Self Concepts of 
raining college students from England, where no such forces appear to operate. 
D COMPOSITIONS. 
nts in England and Jamaica 
the topic: “E myself: the 
I would like to be.” 


IIJ.—SoOME UNSTRUCTURE 


iy As a first step, groups of training college stude 
ere asked to write unstructured compositions on 
person I am ; the person others think I am, and the person ; 
th ese three sub-divisions of the main topic orresponding to the 
ree levels of the Self, viz., the Cognized Self, the Other Self, and the Ideal 
pee The aim was to discover jes that students use In reporting on 
t 1e Self Concept at these three levels, a ( ative evidence as 
© the differences in the way students untries cognize themselves. 
ihe: There was evidence in the essays ents were quite honest in giving 
‘heir estimates of themselves, perhaps because they had been assured of 
anonymity. Remarks such as the following were qui! 
* Tf this were to be read by any ne in this college it woul 


impossible for me to write this.” 


d be quite 


or 
ut I shall try to be 


“J find it difficult for me to describe myself, bi 
perfectly honest.” 


156 Self Concepts in Two Sub-Cultures 


There was some evidence too, that the exercise tended to make the students 
somewhat uncomfortable, e.g. : 


“Tam, at the moment, not liking putting down all these thoughts 


about myself, because it injures my pride to see myself in such a bad light 
or 


“ This hurts, but this is the truth.” 
An analysis of the essays showed that students in both sub-cultures used hae 
or less the same categories in defining themselves. These categories, ae a 
percentages from each group that gave replies in each category (for the leve 


the Cognized Self) are set out in Table 1.* 
TABLE 1 


, E 
TIONED ITEMS IN VARIOUS CATEGORIES AT TH 
THE COGNIZED SELF. 


PERCENTAGE oF STUDENTS WHO MEN 
LEVEL oF 


English Jamaican 


Regular Mature Regular Mature 
M eo Me | oe | ope F | M F 
N—| 60 


- Physical Character- 
istics 


Categories 


== 25 100 


. Philosophical 
Category ......., 
- Performance 

Category ........ 
. Social Relationships. . 
- Status and Role .. = 
Vants, Goals and 
Aspirations, 
- Aesthetic Life k 
+ Miscellaneous 


S © yone P 


BZS 


ere 
ao) 


e differences in the way in which students in each grouP 
l 


Certain qualitativ he 
described themselves also emerged. For instance, there was a tendency for T 
English students, especially the male students, to denigrate themselves. 
example, one man wrote : 


. I make a hell of a lot of noise ; 
lie too much, but a 


; CIB rie nroa 
tie: 
ich po warm-hearted person (and) have some of the finest quali 
of which human beings are capable.” 
* For further details, see Phi 
Groups of Training College S; 


ted 
illips, ASA- Study of the Seif Concepts of Select 
Training Situation.” Unpubl 


; che! 
students in their relation to other variables in the Tea‘ 
ished Ph.D, Thesis, London University. 
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“Tam 
. ++ + person of more than avera, bili i 
ina 2 ge ability. I am 

Certain es dealings, thus causing my character to ba beyond ES 
shows the hi h ies were evident among the various groups, however. Table 1 
of ‘ eon: percentage in all groups that gave responses in the categories 
his is in A and Character,’ ‘ Performance,’ and ‘Social Relationships.’ 
selves in term ri ea with Jersild’s? claim that mature people tend to see them- 
frequenc “ee Se inner forces rather than of external qualities. The generally low 
Dress Pi G Tesponses in such categories as ‘ Material Possessions,’ ‘-Clothes 
rooming,’ ‘ Status and Role,’ etc., applied to all groups in both sub- 


cultures, 
M : 
uch greater emphasis was placed by both groups on the Cognized Self, 
elf.* This may have resulted merely 


Tat 

fon a ioe the other two levels of the Sı 
far fuller aet that the Cognized Self appeared first in the essays and so received 
difficulty į remibut, though many students admitted that they had great 
ly amo = i nowing what other people thought of them. Also, some, particular- 
g the Jamaicans, claimed that they would not wish to be any different 


from what they were. 


T= 
V.—CONSTRUCTION OF AN OBJECTIVE TEST FOR MEASURING THE 


SELF CONCEPT. 
uantifiable measures of the Self 


tructed. This was divided into 


In order to secure more objective and q 
and the Ideal Self. It was 


C 
re a test of the self rating type was cons 
parts measuring respectively the Cognized Self 


deci : : i 
Cided to investigate only those categories at these two levels of the Self, as 
udents in their essays. Twelve categories 


W 
y actually reported on by the st 
sufficient investigated, as one, viz., * Animals and Pets’ did not appear to be of 
analysis universality to warrant inclusion. | Later, as a result of statistical 
eleven s, another category, that of ‘ Aesthetic Life’ was also omitted, leaving 
ee eee in the final form of the test. 
stude e individual items in the test consisted of statements made by the 
were E in their essays. The main criteria for selection of these statements 
count hat they had to be applicable to people of different races, living 1n different 
Posi tries, to both sexes, and to people of varying ages. Also, items had to be 
sitively framed, that is, indicating possession of a desirable quality. Students 
m 0-4) as far as each item 


yee asked to rate themselves on a five-point scale (from S 
gr applicable to them. This gave a score for each item, and as items were 
Ouped into categories, a score for each category. A summation of all items 
for the level of the Cognized Self. For the level of 
these same items, not 


as a total Self Concept score 
asth deal Self, students were asked to rate themselves on e ite 
bet ey thought they were, but as they would like to be. The algebraic difference 
a eet the scores on the Cognized and the Ideal Selves (the discrepancy score) 

e a measure of Self Acceptance. 

gori An unequal number of items was selected for each category as some cate- 
ries were regarded as of wider conception than others. After the first trial 
gether, covering more or 


Tun, it w a S 
i as apparent that certain categories hung to; € 
less the nr tet. o s Tt was, therefore, decided to group 

correlating 


Cifosacins f the Self Concept. é , 
kr gories into areas. Evidence for such grouping Was provided by: 
Wares for various categories with each other. Next, the items forming an area 

ere tested for evidence 0 g to the Guttman technique, and 


thi f scalability accordin 
1S revealed that they On this basis, five clear areas 


th did, in fact, form a scale. 
1) oo Concept were revealed, Viz. : 
Ossessions and Desires. 


* See Phillips, A. S., loc. cit. 
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(2) Personality, Character and Beliefs. 
(3) Social Aspect. 
) Performance Aspect. 
(5) External Appearance. . 
All further comparisons were done in these areas. 


The reliability of the test was determined by a re-test after an po a 
three weeks, when a product-moment correlation coefficient was calculated. 


This gave an overall figure of +--87 for the Cognized Self and -+--80 for the 
Ideal Self. This was regarded as adequate.* 


V.—COoMPARISON OF STUDENTS FROM ENGLISH AND JAMAICAN vai press | 
Table 2 gives the numbers and average age of students in each i a 
‘ Regular ’ students were taken as those between 18 and 24 years ; ý stomps 
students were taken to mean those over 25 years. Unfortunately, it was 1 


possible to obtain large enough samples of mature male students in either 
country. 


TABLE 2 
NUMBERS AND AVERAGE AGE oF SAMPLES. 


Samples Numbers Average Age 


Regular English Female 58 19 years 4 months 
Regular Jamaican Female |.. 


ae 61 20 years 4 months 
Regular English Male .... sie 51 20 years 8 months 
Regular Jamaican Male i 47 20 years 11 months 
Mature English Female |. ik 11 36 years 4 months 
Mature Jamaican Female 98 30 years 5 months 


In comparing scores for trainin 


g college students from both sub-cultures, 
only total self concept scores are gi 


ven here.t These are set out in Table 3. 
TABLE 3 


MEAN SELF CONCEPT Scores. 


Jamaican English 


M 


COGNIZED SELF 
Regular Female 
Regular Male 
Mature Female 


IDEAL SELF 
Regular Female 
Regular Male 
Mature Female 


SELF ACCEPTANCE 
Regular Female 
Regular Male 
Mature Female 


* For complete test, sce Phillips, A, S., loc. cit. 


} For full comparison, see Phillips, A. S., loc. cit. 


A. S. PHILLIPS 159 


It ha , 

less aa A ep that the Jamaican students would cognize themselves 
self acceptin y, that is lower than, their English counterparts, and be less 
Jamaican eid as have a higher discrepancy score). It was revealed that the 
well as the mat nts, both male and female, those of the regular age range, as 
Coenized asw ie female students, rated themselves significantly higher on the 
(ies, they Poe ASON the Ideal Self. They also had a lower discrepancy score 
significance a Ph agrees though this difference does not reach 
areas of the selves. ature female group. These trends are generally true for all 
over ee sete to explain the higher rating achieved by the Jamaican student 
and racial a = counterpart. Perhaps it has its roots in the awakening national 
this period mai evident in Jamaica over the last twenty years. During 
d has come a more advanced political constitution for the island, and a 

fficial colour 


widenj Ste A 
ening of opportunities for its dark skinned inhabitants. An uno’ 
has disappeared. Generally speaking, the 


b 5 
pe Nae operated prior to 1940, 
J amaican acquired a new pride in himself. But this does not explain why the 
much hi eae. even the much older ones, should cognize themselves SO 
nother er than (instead of more nearly equal to) the English students. 
forces } possible explanation 1s that defence mechanisms are at work. Many 
i have been trying to impress on the Jamaican, his unworthiness and 
this. So he 


in ! 
adequacy. But he cannot live with a self picture as unlovely as 
sing of himself as much better than he actually is. 


st reassurance by conceiv tter, 
an commenting on the operation of the mechanism in the Self Concept, 
need a that the need for self deception e of a more fundament: 
maintain or restore self esteem. 
the differing experiences 


of gg chars, koyeuen the explanation 1 

po ul. respective groups. The Jamaican samples were dr m level of the 
Ake ation which, by and large, did not have many opportunities for getting 
ad and achieving success. But these particular ones had achieved greater 
ves as more i igent, of better 


success than their peers. Thus, they saw themsel 1 
On the other hand, their 
ar schools who had gone 


ae able to achieve more, etc. 

on = were less successful than their peers from gramm ] 
not o university or entered other more highly esteemed professions. 

noted, too, that though the Jamaican samples had a higher Ideal Self, they were, 


in fact, more Self Acceptant. 


arises becaus 


TABLE 4 
F CONCEPT. 


DIFFERENCES IN THE SEL 


SEX 


Cocnizep SELF 
Jemaican—Regular aoe 
nglish—Regular -++ 


ao SELF 
maican—Regular 
English Regular reese 


J SELF ACCEPTANCE 
de aican- Regnar rse 
nglish—Regular ..-.-- 
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n expected that there would be no difference in the scores of men 
and pe hi fhe. Self Concept scales at either levels of the Self or ka pe 
Acceptance in either sub-culture. . It was found that this was true, fo a 
English samples, but not for the Jamaican samples. For the Jamaican sanp 
though the difference in the total scores of men and women did no me 
significance, women cognized themselves significantly higher than men on pe 
of the five areas making up this total score. On the Ideal Self women 
significantly higher than men, and they were less Self Acceptant. ; a 

It-is possible that the explanation of the difference in Self cone 2i 
Jamaican men and women lies in the fact that the Jamaican society is aoe 
what matriarchal in structure, though in certain respects it is patriarchal o: Hee 
Victorian flavour.* Women very often have to assume the role of both sah 
and father in the family. They are very often the mainstay and general provi ne 

. It is not surprising, therefore, that they should conceive of themselves in hig! ue 
terms than their male counterparts. But perhaps, they are not satisfied Ae 
playing this role. This might account for their being less Self Acceptant th A 
men. No similar pressures appear to be operating in the English population. 


TABLE 5 


AGE AND THE SELF CONCEPT. 


Mature 


COGNIZED SELF 
Jamaican Female 
English Female 


IDEAL SELF 


Jamaican Female .. 
English Female . . 


SELF ACCEPTANCE 
Jamaican Female 
English Female .... 


It was believed that by the time people have reached adult status, the S elf 
Concept would have b 


very little in Succeeding years. It was, therefore, hypothesised that there 
would be no significant differences 

‘regular’ and ‘ mature’ training 
found that this was true of the 


* See Kerr, M.3 


fFor full exposition, see Phillips, A. S., loc. cit. 
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VALUE OF THE ORAL RESPONSE IN BEGINNING 
READING: AN EXPERIMENTAL STUDY USING 
PROGRAMMED INSTRUCTION* 


By JOHN D. McNEIL anp EVAN R. KEISLAR 
(University of California, Los Angeles.) 


Summary. The question of the effect of vocaliz 


‘ation in reading has not received 
a clear answer. In the present study, the proc 


edures of programed instruction 
saying of words and sentences ou 
loud by beginners i i i i 


not oral reading, was given at the end of the | 
the hypothesis th i 


; words out loud in learning to recognize 
and understand written words and Sentences? In order to collect evidence 


and actions, 


The specific hypothesis of the Present study was that kindergarten childre? 
taught by Programed i i i i 


This rese: 


r arch project was 
Education, De a ay 


£ 
nae 
orted by a grant from the United States Offic 

‘partment of Health, Pee ee s 


Education, and Welfare (Grant No. 1413). 
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oo II.—DEsIGN OF THE EXPERIMENT. 
Program, to ee of the study called for presenting an auto-instructional 
children ins isting of seventeen daily lessons, to a number of kindergarten 
halt of ee proB half of whom were required to make oral responses and 
Measured by Sa not. At the end of the last lesson, the extent of learning was 
constituted. : pa The presentation of the program to the small groups 

replications of the experiment, providing evidence of reliability of 


tł 
ne treatment and its effects. 


Subjects, 


The childr i 
Bailie: en used in the study h 
nd were the total kindergarten pf 


ad an average chronological age of 5 years 
schools, O lations at three selected public elementary 
donimuntie © school represented an upper-middle class native-born white 
predominately one school was located in a lower-middle class community, 
Uam dale y of Japanese and Mexican and a third was located 
Cities, e class community typical Southern California 


backgrounds ; 
of those in most 


ment, each child was given an intelligence 
vhich measured his knowledge of letters, 
configurati t d ability to recognize likeness in letter 
Scores ze aons Children from the sa n were paired on the basis of 
memb obtained from these tests and also upon their record of absenteeism. One 
meek of each pair, randomly selected, was assigne , the other 
er to a non-oral group. From these pairs thirteen © 


Oral gri See 
groups consisting of ten or less were constructed. 


There were 231 children in the original sample. The subjects who were 


d 
Topped from the study were : forty-three children absent more j 
ble to speak English and two children 
i + disturbed). _ The 


e study. These 
from 72-138 


JII.— METHOD. 
essons which took 
de up of approxi 


Dy, 

Programed Materials. ai 
ted of seventeen dal y 

med materials, 


sli we instructional program consis 
mat tly over three weeks to administer. ach lesson was ma! 
mately thirty-five questions OF frames. Two sets of the progra 3 
ed. The non-oral set camia 
MENE 


identical į 
in all respects but one, were prepar" 
aped commentary wich directed the learner to ‘look at the word(s) ” ; 
“ say the word(s).”” In order to assure that 
i terials, all subjects 


o J ri 
eMag set directed the learner to ` $a} 
aes both groups were attending to the reading materials aie 
e e required to respond to problems throughout the program ; for each prob- 
m, children were to select one of three choices offered by pressing an appro- 
Priate button. The oral group, therefore, made poth oral and button-pressing 
€sponses ; the non-oral group made only the putton-pressing response. 
ram both as independent words 


Fort z 
a y words were tau. ht during the pro 1 
and as sentences. These ue ted terms commonly found in the oral 
: i.e., and, with) 


Vocal ds represen n 
af bulary of young children’ and included both function words ( 
int those with a direct referent (i.e mother, girl). The printed words taught 
Ass study were among those used in everyday speech by .children. The 
ly was not concerned with extending the child’s vocabulary. 
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(2 


(3 


(4 


(5 


(6 


2 


the first less 


(2) 
(b) 


Oral Response in Beginning Reading 


Examples of the 700 frames in the program are shown below : 


Purpose. 
To introduce a new 
word. 


To facilitate reading 
from left to right and 
to emphasize relations 
of a printed word to a 
pattern of language 
structure. 

To maintain motiva- 
tion as well as to test 
for comprehension. 


To help the learner 
identify sentences 
which are appropriate 
answers to the quest- 
ions he hears. 

To help learners rely 
on more than one cue 
in identifying words. 
(e.g., length instead of 
initial letter or con- 
figuration alone). 

To give practice in 
Tecognizing and com- 
prehending function 
words, 


The program aimed to deve 


on, efforts were made 
to new combinations of 


meaning, (2) to identify t 
answer to a posed questio: 
among words, and ( 
objectives, the following pri 


In order to get the le: 


Frame. 
Children 
children woman dog 


children 


TUM) sanesa 
fast girl boy 
the boy and girl can run 


picture picture picture 
A B C 


Billy Billy we go 
can eat runs fast 


children cold chair 
children ....., run 
we fast can 


ad from left to right. 


nciples of program prepa 
arner to read from 


lop four genaralizable skills. 
to teach children (1) 
printed words, arriving at 
he particular word in a sentence which 
n, (3) to respond toa v. 
4) to re 


Commentary. 

“ Children. There are 
children in your room at 
school. The word at the 
top says children. (Say 
the word.) (Look at the 
word.) Which word bez 
low also says children ! 

“ (Say) (Look at) me 
words at the top. W 
word below sounds righi 
with the top words: 


“ (Say) (Look at) the 
words at the top and fin 5 
the picture that 80° 
with the words.” do 
“What does Billy ¢° 
when he is in a hurry ‘ 


“ Which word says child- 
ten?” 
“Find the missing 
word.” 


Beginning, with 
to respond appropria ge 
a complete thought n 
provides 


; eget atin, 
ariety of cues in discriminating 


four 


introduced on the right of previously learned words (e.g., Frame (2) above! 
Pictures were rarely used in the 
were prompted orally by the ta 


rds & 


at or say the wo’ h the 


picture from among three that ‘“ went wit 


e 
F 3 z d th 
as built around reiteration of the words a no 
word and sentence recognition. There VE gat 
tructure. Words and sentences were reviev 


generalizable skills of 
narrative or thematic s 
unexpected times. 
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The ove: : 
al) was 33 rall difficulty level of the 
program (pe 
P er formanen T yor ior ST sral group and 38 ERS ASE 
ach oie 7 reflected a range in difficult f : 
23 per sone p bp The average difficulty of the daily Yeon es ae 
per cent. errors (oral group) and 23 per cent. to 51 per beat 


(non-oral group). 


The =| partas: 
ne a P 
Each sg provided for instruction of ten or fewer children at a time 
Phones throuEl uan individual laboratory-type cubicle equipped with head- 
a microphone a hich he heard the taped commentary- The cubicle also contained 
consisted of thre well as response and confirmation panels. The response panel 
the mul tplechoi buttons by which the child indicated his selection in answering 
was by lights hoice problems in reading. Confirmation of the correct answer 
response, and ; a green light was shown when the learner made the correct 
A ao iy ar one when he was in error. 

cubicle. On oe x 8-ft. screen at the front of the room was visible from each 
strip, while si his screen, learners saw each daily sequence presented by film- 
program adv multaneously hearing the accompanying taped commentary. The 
for respondi anced automatically at a paced schedule with 3-seconds allowed 
responses i ris to each frame. ‘As each individual in the oral group made vocal 
of the indivi ? his microphone, he heard his own voice. Frequent monitoring 

at SDE ual oral responses through the microphone circuit gave evidence 
used the e s were responding orally as directed. When the non-oral group 
individual’ quipment, the microphones were deactivated. A record of each 
ofa s multiple-choice performance for each frame was obtained by means 

punched paper tape. 
(i.e. ee appara therefore, combined advantages of both group teaching 
tion (ie omy, control of word exposure time and pacing) and tutorial instruc- 
vision for personal communication without peer group distraction and pro- 
or individual response and confirmation). 


kowe criterion test provided evidence that the learner had acquired 
ge of the words and reading skills previously mentioned. This post- 
-choice items, each with three or four altern- 

pt that the items 


test ? 

consisted of fifty-one multiple: 

o the frames in the program exce 
i Instructions for each 


one item per page. 
:dentical to any frame 1m 

nt of alternatives ha not been 
i t the learner 


Seen P e 
n by the pupil. tems required tha 

e a given word. The re he ame to see the 

to an ora! question, or 


indivi 
ividual word or sentence which supp ed 
ibed a picture- 


an in 
complete sentence, Or which best descri 
as follows : Wie ts 
est Item. 


Examples of the test items are 
He Fe oe geet Instruction. 
Word ects an individual “ Find the word hat (a) boy, dog, toy- 
ere it appears with says dog.” (b) children, tree, dog. 
aai a (9 like and (b) 
ay e shape. 
will Ket the word that “Eind the missing 
Never omplete a sentence 
seen before. 


t 
e program; the con 


a dog can play «+++: 


ball 


word.” 
red, with, no, little 
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The test was administered in group situations, twelve children at a time, 
oral and non-oral together. 


IV.—RESULTS AND CONCLUSIONS. 
(1) Overall Effect of Oral Response. 


The mean score for the oral group on the criterion test was 30-3 (SD= 10-0) 
while the non-oral mean was 23-8 (SD=7-9). This superiority on the part 0 
the oral responders was significant when based on the average difference 
between matched pairs (t=5-64, p< -01). Since the mean score for forky-seved 
uninstructed kindergarten children was only 12-7 (SD=3-3), both the oral an 
non-oral forms of the program appear to be effective in teaching. 


The superiority of the oral response group is illustrated in Figure 1 wiers 
the results are presented for the three groups in terms of percentage of tota 
possible scores. Since there was no correction for guessing, the dotted line 


indicates a chance score which would have been obtained by blind guessing: 
Using this line as a base, it may be noted that the effect of the oral response 15 
considerable. 


60% 
Post-test 50% 
Score 
Percentage 
of Total 
Possible 40% 
30% 
20% 
Chance 


Oral 


Non-Oral Uninstructed 
Control 


FIGURE 1 


COMPARATIVE CONTRIBUTION OF ORAL AND Non-O 


ING 
RAL RESPONSE Mope In REAP 
ACHIEVEMENT. 


ing non-oral group. On the basis of a sign test, this difference is statistica” 
significant (p < -01). 


(2) Value of Oral Responding at Different Levels of Ability. 
Although it was 


jall 
: was speculated that the oral response would be espec!# A 
important for children of lower intelligence, the interaction effects were 
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rest tatisti However, the direction f th 
s suggest the desirability of further inquiry regarding the special sade 


ages = 
of oral responding by those of lower intelligence. Table 1 below shows the 


mea: 
n scores by three levels of ability for both oral and non-oral responders. 


TABLE 1 
L AND Non-OrAL SUBJECTS. 


la: 
tge enough to be statistically significant. 


POST-TEST RESULTS FOR OR“ 


Non-Oral 
Written Post-test 


Oral 
Written Post-test 


Low I.0, N 

ow LLC, (N=60) ... > M25- Z7 

fea hare ee aie wel 
M=30:3 


M=33-7 


MIERLO. (N= 60) ps ani ane oranes EE 


M=23'8 


Total N= gas unde seers: M=30°3 


Children to Want to Read. 


udinal effect of the prost 
child. Children from both oral 


3 
(3) Value of Oral Response in M otivating 
am and the 


oral 2 order to throw light upon the attit 
and Ppa an interview was held with each 
conti on-oral groups were individually asked 
Vik “sat with the program when given eac 

Cer a to records ; (2) listen to a story ; 

to on per cent. of the children both grou 
a en more of these other activities; 10 per cen e 
thee a a to all four of the other alternatives. Parenthetically, we can say 
ent} 1e introduction of a formal reading program was received W 

husiasm as currently popular activities in the kindergarten. 


me ee achievement in learning Was. 
Tes rence for continuing the reading activity, the data sugge 
abi ee might have motivational properties í 
nA (I.Q. below 100). Preferential scores for oral and non-oral groups in 
è le and high I.Q. levels averaged 1:5, but preferential scores - 
vel were 1-6 and -97 for the oral and non-oral groups, respectively. 


whether they would prefer to 
ing as an alternative : 


facilitates the 


The result 
s show that or i 
mi i f printed words and sentences. Children who 
rior on a test 


frente a Peer mene tructional program were supe 
S ora! i uto-instruc F 
Which called ia rg ication of a variety, of reading skills in contexts different 
ae those taught in the program itself. The oral respon ee a 
a ue even though the post-test called for silent reading, not oral respo 8) 
and none of the items of the post-test appeared in the training program. 
nism involved in the 


As indi ier, one way to ex Jain the mechanism | { 
Sreater iidicated Fh n5 y k is that the distinct stimulus of 


„t vocalization 1 
a f the obtained effects. 
belief that oral responding in 


(4) Conclusion. 


Sayj : 
E words aloud is require 
he experiment gives no St ort for the 
ga, mo ole effects On the contrary, we must conclude 
Jearner are often 


learn; 
hat, per to read has unfav 
associa though vocalization and 
Teade: ated, the act of vocalization 
Voc a in what for him is already & 
alization, his performance would be worse. 


aid to the beginning 
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T 
HE INFLUENCE OF EARLY PUBERTY ON TEST 
PERFORMANCE AT AGE ELEVEN 


BY 
J. D. NISBET 
(Department of Education, U: niversity 
AND 
R. ILLESLEY 
(Obstetric Medicine Research Unit, University of Aberdeen) 


of Aberdeen) 


at the earlier onset of puberty among girls 
tests at the age of 11 plus. This question 


Te Samiaa by the analysis of test scores at ages 7, 9, 11 and 13 from a 
fathers oh of girls numbering 1,385, for whom age at first menstruation and 
slightl s occupation were k turers proved to have a 
a ightly higher average test they were superior at all 
ges and not only at 11. While the results at age 13 do not rule out the 
Possibility of a later pubertal spurt, it can be concluded that early onset of 

puberty does not confer any advantage in selection tests at age 11. 
I.— INTRODUCTION. 


Tu . f 
E evidence for the secular trend towards early maturity has 


Te 
Fecently by Tanner (1961): for example o 
whereas the average age at menarche (or first menstruation) was 174 years, 
stea aily to-day it is about 13 years. The trend appears to have De: tai d 
Th ily over the whole of the last hundred years, and it is still continuing. 
ed times, menarche seems d around the age of 14. 
x el eid Juliet was 13: he devotes ten lines | 
ee that she is just “ a fortnight and odd days h 
i day.) The factors influencing the age of puberty remaul 


in F; 2 
adequate records and differences in mortality T 


a AN trend has aroused conce: 
earlie: ing procedures of selection for secondary eae Gaal 
ace le, than boys ; and if there 1s 4 spurt in mental growth similar to e 
scceleration in physical growth at puberty, then as puberty occurs earte 
burt will be beginning to affect the performance of early maturing girls in 
i dvantage of an early growth spurt is 
anner ( d reduce the yaioty o th ee eee 
ner (1962) has recently re orted that early maturing A ý 
mng nate on average co election tests than the late maturers. Howey, 
s finding does not allow for the possibility that the early maturers may have 
t all ages over the late maturers. ik i 
Other indi ‘dence would appear to be consistent wit the hypothesis 
that scores at ea ea beginning ioe affected by the earlier onset of puberty, 
ough here too other explanations of the findings are possible. The Scottish 
ental Survey (1949), in which a test given to all the 11-year-old children in 
cotland in 1932, was given again to the 11-year-old age-group 10 1947, showed 
sat the average score in 1947 icher than in 1932. In 1932, the boys were 
irls had not only caught 


Sli 
ghtly better on average than the gi 


S 
Da It has been suggested th 
ay now be affecting performance in 


been summarized 


169 
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ji ints hi the 
d passed the boys, and scored on average two points higher maa t 
gaat (1950) in a similar comparison of age-groups in ten rele 
areas over the period 1936-1947, also found that the average score of ae er 
increased relative to that of boys, though there was some indication t Te 
girls’ superiority had begun to fall away by the age of 13. Boyne and 


(1959) found a similar increased superiority of girls over boys among Aberdeen 
school children over the period 1947-1957. 


These findings would appear to be in accord with the hypothesis ee 
spurt in mental growth accompanying puberty is affecting performance 


Thus, if there is a slow general improvement in over-all performance (whether 


: ; 9 s 
due to coaching or any other cause), this would also result in a trend toward 


girls’ superiority as the whole age-group moved up into a type of item which 
tended to favour the girls. - 


For an adequate test of the hypothesis, a longitudinal study is necessary: 
An age-group must be tested at interval 


it can be safely assumed that puberty 
up to at least age 13 (by which age i 


puberty, if it exists, will have begun to show an effect). Comparison of the 


nd the remainder provides a test of the 
ntal growth accompanying puberty, oe 
berty is affecting performance in tests a 


hypothesis that there is a spurt in me 
will show whether earlier onset of pu 
age 11. 


II.—Susyjects, TESTS AND OTHER DATA, 


7 ‘ scond 
as those girls who were in the ae. 
hat is, around the age of 13 or ove 


9: 
was given. For this year-group, 
to identify the earlier scores of t 


the whole age-group at ages 7, 9 and 11 


ê- 
some 6 per cent. of the go 
e two schools had taken nd, 
ssed the earlier tests. In the & 
complete test records from age 7 were traced for 85 per cent. of those teste 


5 jo 
towards the lower occupat 


9, 

The tests used were: at age 7, Moray House Picture Test; at a 
Schonell’s Essential Intelligence Test ; at age 11, Moray House Verbal Ke 
ing Tests 57 and 58 ; and at age 13, Moray House Advanced Verbal Reasoning 
Test 10. Not all the sample sat the earlier tests at the same time: thus, > test 
11, the large majority sat Moray House Tests 57 and 58, but a few sat the 


nal 
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applied t Ä z 
ER year, Tests 55 and 56. This difference had to be ignored : 
Tests are Site provide for differences in age at testing, and the Moray House 
All the test nani to be, and generally are, comparable with one another. 
Means and st E a were re-standardized on this sample at each age, to equate the 
test, thus eli andard deviations and to produce normal distributions in each 
Dland chs oot ee differences in standardization between tests. Means of 
ndard deviations of 15 were imposed in a conversion based on scores 


grouped in inte. i i p 

ae eh of five points : interpolations within intervals resulted in 

h worked out slightly below 100 and standard deviations below 15. 
f the original scores and of the scores 


he ens 
Pia cone and standard deviations 0 
onversion are set out in Table 1. 


TABLE | 


MEAN as 
S AND STANDARD DEVIATIONS BEFORE AND AFTER RE-STANDARDIZATION. 


Revised Scores 


Original Scores 
s.d. 


Test at Age 
— Mean s.d. Mean 
106-85 15-33 99°77 14-66 
109-70 15-53 99-66 14-81 
104-95 13-68 99-82 14-82 
106-41 13-44 99-66 14-69 
107-90 13-93 


tion = next step was to fin each girl. The investiga- 
an india restricted to girls so that the age at first menst d be used as 
about + ation of whether pu ` Since the sample was age’ 
‘eis 3 or over, menarche W 
nda ni pee not necessary 
onsiderable proportion ha 
gel oe to separate the early d z , 
ee Existing medical records did not include age at menarche ; 
girls ol Health Service in the city un c 
ferr in the sample to ask for this information. 5 
Ree that the information should be got by writing to the mo 1e gir 
kad eee done and practically all replied. Difficulties at this stage of the inquiry 
ad been anticipated and particular attention was given to the method of 
ved, and information 


approach. No one was scandalized, no protests were rece 
age at menarche was obtained for 98-8 per cent. of those tested at age 13: 


i Information on social class W o carefully designed 
questions on father’s occupation. Vn this basis, pupils were graded into SIX 
Sroups on a modified version of the Registrar-Genera I and II, 


Professional a veS IJa non-manua III skilled 
manual pal poi : ae 1 orkers; and V unskilled workers. 


hen a pupil’s description of the father’s occupation was too vague to be 
ed, through the headmaster, for further information. 


Classified < 
Mi: s aske : 
eae (father’s occupation) ” was permitted, to allow for 


er special cases. Only 2:3 per cent. of the total 
i of the six categories and it was possible 


8roup could i radi 

not be given a ading in one 4 : 
ii establish from i ei records of the Aberdeen Maternity Hospital 
lat a considerable proportion of these were, in fact, illegitimate children. 


to rely on distant and possibly 
È but it was still 
from the 

but the 


killed manual w 
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III.—ANALYSIS OF RESULTS. 


i s a 3 ] on 
lete test records—that is, scores on tests at ages 7, 9 and 13, and 

two ee age 11—were available for 1,402 girls. Information on menat i 
was lacking for seventeen of these, leaving a total of 1,385, from ee 
following results are derived. The age range at the time of interview nn me 
six pupils who were under 13 years, and six over 14 years 6 manes, Dr ane 
cent. of ages at interview were between 13 years 3 months and 14 years 2 ec 
The median age at menarche in the sample was 13 years 2 months. yari ‘ove 
tion began before the age of 11 in twenty-four cases, under 2 per cent. ; j y 
age 12 in 165, or 12 per cent. ; and before age 13 in 40 per cent. of the group. 
Age at menarche was classified in three categories : 


(i) The very early maturers, with menarche before the age of 12 years 3 
months. 


(ii) The early maturers, with menarche between the ages of 12 years 3 months 
and 12 years 11 months. 


(iii) The remainder, among whom menarche occurred at or after the ag" at 
13 years, or who, being already aged 13 years, had not yet reac A 
menarche. (There were two girls not yet at menarche who were age 


between 12 years 9 months and 12 years 11 months, and these were 
included in this third group). 


This third group, therefore, included both the 
maturers. It was not practicable to distinguish thes 
a group of 358 girls among whom menarche occurred after the age of 13 yen 
9 months. The mean scores of this group proved to be about five points 9 
I.Q. below the average, but this was a spurious result arising from the nature 
of the sample. Where a sample is drawn from an educational year-group, there 
is a tendency for the brighter pupils to be younger than average and the duller 
pupils to be older. Younger pupils who were to prove late developers had ns 
reached menarche at the time of interview. An attempt to establish a category 
of definite late developers from the available data would have missed these 
younger pupils and resulted in a group with a bias towards older age and lowe? 


ability. All pupils who were neither early nor very early maturers were, there- 
fore, grouped together. 


median group and the 
e. There was, in the sample., 


The same objection does not apply to the ‘early’ and ‘very early 


maturers,’ with whom menarche occurred before age 13 years or before 12 yeas 
3 months, respectively, as these groups could be identified with certainty. 
It is thus possible to derive results which would test the hypothesis that th 
early maturers would show a spurt in mental growth around the onset k 
puberty, but not to obtain information on the effect with late maturers. It he 
however, the early maturers who are concerned in the argument about t 


influence of puberty on performance in selection tests at the age of 11. THS 
hypothesis under test 


g to affect performance in testing at age $? 


$ vement, that is, relative y inder 4 
they may previously have shown over the early maturers and the remainde 


3; 
the age of 7 and 9 before any effect of puberty has begun to operate. By age x 
on this hypothesis, the lead e 


to 
have been reduced, as the ea: e 
benefit likewise from this pubertal spurt. 
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The mea: 
ae 
scores of the very early and early maturers and the remainder, 


which are s i 
a t 
et out in Table 2, do not show this pattern of results. The very early 


Maturers are c ; - 
re certainly superior at the age of 11, but this is not a sudden emerg- 
ost the same degree 


ence of T fs 
head. eee Ta with the onset of puberty. They are alm 
s 7 and 9. In fact, the pattern of mean scores between ages 7 an 


11 is, if a A 
> ny 
ything, the reverse of what would be expected on the hypothesis of a 
Differences between the 


pubertal spur 

Means = among very early maturers at age 11. 

(the very e canons pe any one‘age are generally significant 

nly a pariy ata ages over bot ‘early’ : i 2 y f 
y at ages 7 and 13 over ‘ remainder ’). ny ane remand ae san 


TABLE 2 


Mean Test SCORES OF MENARCHE GROUPS. 


Menarche 


Mean T — 
ean Test Score at Age _ 
Very Early 


Remainder 


7 103-5 

9 102:8 99-4 

11 (1) 102-9 99-9 98-9 
11 (2) 102-6 99-8 98-7 
13 103-3 100-2 


Percent: ay 
age enteri i iS 
ary Schools ne Pe gezond 


h the results at age 13, there is some suggestion 
The differences 


ahead of the remainder. 
y are not significant 


ny one menarche categor: 
the drop in mean score of the * remainder 
int is that the 


that Bevond the age of 11, with 
he early maturers are pushing 


etw 
until w n means at various ages for a 
cate i reach age 13: at this age, 
es is significant. An alternative expression of this point 1 a 
increa: e between the means of the ‘ very early ’ and ‘remainder ’ categories 
ses significantly between 1] and 13. This may mark the beginning of a 
h without a further 


pu 
bertal spurt among the very early maturers, thoug 
ils have already left 


testi 
ing at age 15 (now impracticable as many I 
establish such a finding. However, it € 


sc} 
Pediat data are inadequate to c J 
is not ed from the present data that the earlier onset of puberty among girls 

seriously affecting test performance at age 11. 

isa It should, perhaps, be emphasized that within each menarche group there 
Cate, wide range of test ‘scores at each age, the very early, early and remainder 
gories all including pupils whose scores Tall e to the lowest. 
Tans relation of social class (assessed by 
is, of che and test score at various ages can be ex: ‘ 
fa age an association between social class and test score : the average 
girls from Social Class I was between 114 and 117 on all the tests, and the 
between 93 and 95. Again, there 


a 
PD score of those from Social Class V was 2 
maty wide range of scores in each category: The superiority of the very early 
Gites. cannot, however; be explained as merely the product of class differ- 
aböve Even within any one social class, the very early maturers are consistently 
ve the average for that class. The mean scores for each menarche category, 
are set out in full in Table 3. 


Separati 
Parating the social class group: 
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TABLE 3 
MEAN TEST SCORE AND MENARCHE, By Sociat CLASS. 


-== Mean Score at Age— 
Social Class | Menarche 


Category 7 9 11 (1) 11 (2) 


I/II VE 115 116 114 115 
E 107 107 109 109 
108 107 108 107 


< 
wo ow 


107 106 105 
105 102 105 
97 101 103 
101 99 100 


101 99 
98 98 


< 
(zi 


a 


103 102 (22) 
94 96 (44) 
97 95 96 (98) 


97 97 ‘ (28) 
97 93 ‘ ‘ ‘ (43) 
93 94 ‘ 2 92 | (107) 


Social class not stated : VE4; E5; R23 (32) 


< 


E 
R 
iE 
E 
R 
JE 
E 
R 


1385 


Sub-division by social class reduces the numb 
the table, resulting in slightly larger ch 


Some association between social 
have been expected. When the investi 
logically-minded among our advisers w 


and proportion in each menarche category 


Í there is no marked association between age at menar ers 
and social class in this sample. The proportion of very early and early matur 
is nearly the same among daughters of professional and business men as 2™ 
the daughters of unskilled labourers, 
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TABLE4 
DISTRIBUTION OF MENARCHE BY Soctat CLASS. 


Menarche 
Category Social Class 
I/I . Totals 
/ Il IV v 


Very carly 5 5 = 
ii 50 (16) 119 (18) 24 (13) 31 (16) | 273 (17) 


as (26) | 46 (24) | 336 1) 
110 (60) | 117 (60) | 955 (61) 


= 
Pals 306 (100) 6 3 
51 (100) | 182 (100) | 194 (100) |1564 (100) 


Early z 
67 (22) 131 (20) 


Remainde 
er. +| 189 (62) 404 (62) 


a (Percentages in brackets.) 
ni-square ië : n 
(Pabout 0.4). on this table is 8-6, with 8 degrees of freedom, and is not significant 
The i 

inter-correlations S 
follows - nter-correlations between the test scores at the various ages were as 


Betwee 
etween score at age 7 and score at age 9: +0-69 


age 11 (1): 40:72 
age 11 (2): +0-75 
age 13: +0:74 
+0-82 


eat age 11 (1): 


Between score at age 9 and scor 
age 11 (2): +0-82 


7" age 13: +0-82 
ween score at age 11 (1) and score at age 11 (2): +0:93 
age 13: +0-90 
40-91 


Between score at age 11 (2) and score at age 13's 


[V.—CONCLUSIONS- 
sis of the test scores of 1,385 girls at ages 7, 9, 
i t of puberty iS affecting 


The results from this analy 
jety that earl 
is unfounded. On other aspects of the 
rmit any definite 


1] 

bein 13, show that the anx 

effect net in selection tests at age Ty 

conclusi puberty on mental development, the data do not pé 1y Gent 
as ions to be reached. The results at age 13 can be interpreted as indicating 

bebana ee of a possible later spurt. js also possible (some may Say 

Which e) that the onse 

conce: are not measure 

tned principally wit 


Wit] 
h the remainder, and th 


Matu 
ar ete Whatever the causes ©” 
es not show any relationship Detwe. 
i mily size, migration, type 


Class 
of oe ee aspects of this investigation, 5 
iscus: ndary school attended, area ° d birth weight, will be 
sed in a subsequent papê"- 


h a comparison of t 
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THE MEASUREMENT OF LEARNING ABILITY 


By G W. S. MACKAY ax P. E. VERNON 
(University of London, I nstitute of Education) 


SUMMARY.— 2 

learn, aey eo tests are often assumed to predict a child's ability to 
kamaa A ais are seldom based on situations that directly involve any 
eath Bese ES p ery of nine group tests was devised for junior school children, 
e aN 1. he learning of an un nd yielding six scores for 
along with ages of practice. These w 8-9 and 10-11 year pupils, 
added app ami mber of more conventio! der to find whether they 
TE PR preon to the predictions of ived from the previous 
holding aes As a measure of learning, ed, i.e., final scores 
These i le eestor 
al gain measur 
eels though somewhat more 
peal However, the tests differe: 
findin might produce more useful ones. 
abilte tasks relevant to school progress which wi! 
iste ti A stronger case might be made for using fina 

ion and practice on new tests, as predictors. 


].— INTRODUCTION. 
intelligence tests to childr 


implies their capacity to make scl 

s that the tests predict ability to learn ; and, indeed, several writers have 
ility (Buckingham 1921, Dearborn 1921). 

following instruction 


ly state that im- 


familiar task, a 
ere applied to 
nal tests, in or 
achievement der 
residual gain was us 


es were found to be largely specific in the younger 
consistent and more predictive in the older 
d considerably in their validities, and further 
There are considerable difficulties in 
hich will also cover a wide range of 
l scores, after a period of 


en in school, or at an 
holastic progress. his 


rall (1947), who adds that 
t are associate 
h thing as generalized learning ability. 
asured by tests which are given once only, 1 
e acquisition of new mental skills. 
amount of progress children are 
iod, say @ school year. And 
ic achievement. 
i parent 


fen other tests; i.e. 
ay ipa is normally me 

enc e no opportunity for peat T k 
š e, they bear very little relati 


lik 
ely to make in their school work over a 
hly with future OV 
T i lanation of this ap) 

i child 


ted out the 
Starts evelopmen i € mulative. 
tarts the year with a fund of information and mental skills, X., which are very 
acauj i i intelligence test, 7) during the year he 
quires further skills, Y. Now i ith final achievement, 
S Y, is still quite large x is large although the correlation Try 
y Ne very small or even Zero. 
Surely, it should be ossible, then, to devise j 
[farming a a controlled akon, and which might add to the predictions of 
Soar r performance already covered by intelligence tests. Such tests should be of 
Sho table value to the psychologist and/or the teacher. Furthermore, they 
o ma, provide even better opportunities than current intelligence tests for 
att erving and assessing children’s methods of tackling difficulties, their 
€ntion, perseverance, etc —qualities which are highly relevant to future 


SUCI 
cess or failure. 


tests which actually involve 


177 
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II.—Previous INVESTIGATIONS. 


‘ P y aie ae 
iderable amount of work on the factors involved in various ty pes 2 
pe ec been carried out, though little of it has much bearing on ae 
achievement. Anastasi (1930, 1932), Thurstone (1938) and others, hav ec a 
strated the existence of rote memory factors in the learning of paired = et 
words, digits, etc. ; for the most part, these seem to be fairly specific to t oe 
of material and the conditions of presentation and recall. There is no evic a 
that they correlate with school learning, except in so far as they involve ES] 
verbal ability. Vernon (1961) claims, in his review of the field, that log E 
memory or the learning of meaningful materials depends wholly on thes 


familiar factors, i.e., that no special faculty of meaningful learning can be 
distinguished. 


success during the early stages of learning. A similar investigation of learning ê 


blished by Burroughs and Morris (1962) 
Allison (1954, 1956) and Mollenkopf ( 
selecting U.S. Navy personnel, designed perf 


1957), as part of a programme gor 
ormance tests involving the learning 
subjects were instructed in assembling variou 
models, and tested after each 


period of instruction. They found that these 
learning scores predicted achievement at the end of mechanical training @ 


least as well as the N avy’s Basic Battery of aptitude tests, and that they eh 
i redictive efficiency of this Battery- 


3 a +. materials 
carried out with scholastic mete Bi 


The most quoted studies 
(1938, 1939), who practised twi 
simple tasks 

ere also the 1 
and there was little evidence of any general factor among the gain scores, nv 3 
overlapping between the amount of improvement and intelligence. Ney fi y 
c types of task, e.g., numerical, perceptual, di jrow 5 
Some positive correlation with NV and P factors. One reason for Wood! 


s: 2: 
€ l that the tasks employed were of a rather iba- 
kind, and Tilton (1953) claims considerably greater consistency in an invé eight 
tion of school subject learning. Here, standardized achievement tests 12 
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subjects wi , 
jects were given to fourth grade pupils, and repeated twenty months later. 


There y tae 

for Atenia ached pesky correlations, averaging 0-30 (01 0:51 when corrected 

: ation) between the gain scores, indicating a prominent general factor 
with initial achievement ; 


Mm sc i P: 

ie, hon) progress: Gains also correlated positively 

th: ho were more able in any subject tended to progress better in it 

an those who were initially weak. ! ee 
have been reported from the 


i ee ae two large-scale investigations 

learnin te Pg Service, Princeton, N.J. Stake (1958) applied a dozen 
Both B bal s, each involving several trials, to 240 seventh grade pupils. 
erbal and non-verbal, rote and meaningful, tasks were used, and an 


extensiv. 3 
sive battery of reference tests was added. Each learning task was scored 
f the learning curves, and for 


fi 
i rror ‘OF overall success, for slope © l i 
AEri of progress. The last set of measures yielded mostly negligible 
with oie. bot the success measures correlated positively with intelligence and 
cluded Th astic achievement. On factorizing his complete battery, Stake con- 
different ih there is no general learning ability, but a number of factors for 
learni kinds of tests ; also that there is litt tween slope or rate of 
ng and familiar psychclogical factors. 
reater variety of 


Allison’s (1960) study was similar, but it included a g 
1, rote learning and concept formation), 


oe tasks (psychomotor, mechanical, T 

pro were given to 315 adult recruits. These were scored for average rate of 
fas and for faster or slower progress 1n the second than 1n the first half of 
in 2 The main factors among these measures included : (1) Conceptual learn- 
thee (2) Mechanical and motor learning ; and (3) Rote learning. On comparing 
She e with the factors established in a battery of thirty-nine reference tests, he 
nu erved, unlike Stake, that No. 1 was related to general verbal intelligence and 
fa mber ability, No. 2 to a spatial-mechanical factor, and No. 3 toa rote memory 

ctor. 

Tai would seem then that learning progress is certainly not 1 
karea ee though there may be s 5 
abi ing tasks are studied. There is little evi¢ 
of Aty; common to all types of task, though this 
mie ee peeks of the tasks. Rather, learning measures te 
c mber of rather ill-defined factors, according to the type 9 
rapa of learning and reca 1. Some of these group factors 
© ability factors, for example verbal Jearning correlating more 
verbal ability, non-verbal learning with spatial ability, etc- 


le relation be 


JJ. —THE PRESENT EXPERIMENT. 

.,, Our own small-scale experiment was conducted against this background 
with two groups of approximately thirty-six primary school children, aged 8- ) 
years and 10-11 years, being the complete unstreamed second and fourth years 
of a school in a good ‘middle-class area. A study with these numbers can be 
regarded only as a first pilot exploration ; put it may suggest Ways of tackling 
a larger sample with more refined techniques if wor 
m The aim was to devise a series of group learning S 
i ate more closely to the normal school learning situati 

Yy Stake. Group tests were preferred, not only because th 
evitably be too lengthy for individual application, put also beca 
Might well learn differently when individually instructed from the way they 

earn in class. Each tes Jeted in a school period ; i.e., long-term 


earnj t was. comp’ r e 
ming and retention were not involved. All tests were given during January 
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; PEE -entional 
in order to find whether the learning tests added significantly to conven n 
tests in predicting achievement. The final criterion of achievement consiste 


experience and achievement. Another problem i ream 
cover a wide enough range of ability to be useful throughout the junior sc a a, 
Ideally, a dull 8 or eyen 7-year-old should be able to make a start with taer 
before he receives Practice ; while at the same time a bright | I-year-old shoy 
still be improving at the end of practice. A description of the tests will show ho’ 
these difficulties were tackled. Details may be found in Mackay (1962). 


(1) Language Learning Test. This test attempted to teach the children a 
new language that they could translate from or into. Esperanto was use 

cture, and because it seemed unlikely that a 

it. The test was a printed one and was tack = 

it became clear that after about 40 minutes, 


ised ; they were designed to reduce anxiety by giving the impression that per 
was a normal lesson, not a formal test. But once the nature of the task had bee 


pointed out, the children carried on by themselves from printed instructions. 


English words with their Esperanto 

article were prefixed with La, those with 
the indefinite article merely had the equivalent without La. This exercise 
i grasp this rule, and they had to use it in translating 
d forward to the next piece of informa 


anslated using the rules governing the 


(2) Number Substitution Learning. This test was similar in structure to the 
Language Learning. 


A simple form of algebra unfamiliar to the children WaS 
used. Letters were given equivalent numb 


complex way without assuming a knowledge 


s 
(3) Verbal Instructions Learning. This test was given six trials, as W@ 
every subsequent test. The chi 


each sheet containing eight ro 
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4) Ve 3 
less met ees mation Learning. The above test involved rather meaning- 
Meet at school gh presented a kind of learning more similar to what children 
simple, but u i Te wenty-five pieces of information were read out; these were 
the first five ET ey to be known by any of them: €g. Bach died in 1750; 
of the compl a ters on an English typewriter are QWERT. After a first reading 
on the E ete list, the childrer answered twenty-five ques 
(not of r oe Six trials were given, and the order of as! 
eading the statements) was ‘altered each time. 
letter words was read 


out 1O W: o Memory. A list of sixteen three or four- 
they e T ildren, and they were given two minutes to write down as many as 
d remember. (In practice they were allowed as much time as they 


ne : 
eded). This was repeated six times. 
The abcve tests covered the kind of 


ERRA 7 orcepinak Detail Learning. 
work. Thi he children learn in the verbal and number aspects of their school 
visual aes E and the two following attempted to see how they learn using 
representing he children were briefly shown a large card with twelve circles 
eyes, no Ing, faces Each face was incomplete in some detail of eyebrows, 
shown = and mouth. No other features were shown, and those that were 
looked sed only dots or straight lines ; no drawing skill was involved. Having 
taini at the twelve faces for eight seconds, they then completed a sheet con- 
ing twelve blank circles. This test was repeated six times. 
plex (7) Geometrical Form Learning. This test presented a large, rather com- 
ape geometrical figure, built up from squares, circles, triangles, diagonals, 
the, twenty distinct points in all. After looking at the figure for eight seconds, 

e children attempted to draw this figure. Again, six trials were given. 

IA (8) Symbol Maze Learning. Some kinaesthetic learning was added to the 
eh rning of visual patterns. Each child had a printed maze, and the class was 
f own eighteen symbols that could not be crossed on the maze. After seeing the 
orbidden symbols for eight seconds, the maz attempted. Six trials were 
ne test of this type 


involved. 
(9) Group Concept Formation Test. Although only © 

was included in the battery, We feel that there is much to be learned from a 

battery of tests exclusively of the concept-formation type, some being individ- 

ual. Bruner, Piaget, Annett (1959) and others have indicated the importance 

of this aspect of learning, and further studies are clearly needed. This test 

involved the linking of nonsense syllables with pictures. The first example 
y aie: ma EON, andia woman—F 


showed a horse—TIX, a dog— TI? , andake 
a cat, and aske 
(a) all, some being 


The children were then shown 

appropriate syllables after each. There were fou l ; 

repeats, with the complexity of the concepts increasing. Concepts in the middle 

range included things that fly, things that make a sound, and people in uniform ; 

and at the end of the scale—things nocturnal, ph c 

transport. In the older group at Jeast it was clear hildren had benefited 
ts when tackling 


from experience with the early, easy, concepts 
ones. 
Reference Tests’ 


e was 


rteen trials in 


The remaining tests in the battery were as follows : 
oice, vocabulary test. 
(11) A reading comprehension test, similar in form 
(12) Schonell’s Essential Mechanical Arithmetic. 
(13) Schonell’s Essential Problem Arithmetic. 
(14) A twenty-item number series test. 


to the Watts-Vernon test. 
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(15) Digit memory, fourteen items ranging from three to nine digits ; this was- 


given orally and responses were written. 


(16) A spatial test involving the copying of printed figures onto a dotted 
space. 


(17) Goodenough Draw-a-Man test. 


(18) Bender-Gestalt, given as a group test with each figure shown on a large 
card. ; , 2 BO. and 
(19) Overall school attainment had already been examined in 1960, 


z 3 4 EE f 
similar tests were given in the Summer of 1961 to provide a criterion 0 
later achievement, 


IV.—Scorrnc. 

Mollenkopf and Allison seem to have 
their measure of learning. While these ol 
depend very largely also on initial ability. V js 
final-minus-initia] scores, that is the absolute gain. The disadvantage here A 
be linear; thus, there can be n 


for each task, but eventually employed three meas 
y total success, slope or curvature (i.e., a measure a 
i.e., regularity of Progress). Since the latter measure 
yielded few significant correlati i 


1 d yielded bigger differences, but would also be 
likely to be poorer in reliability). However, the amount of progress was deter™- 
ined by correlational analysis, as follows, not by subtracting initial from fina 


Manning and DuBois (1958) argue that the most appropriate measure of 
learning is “ that portion of the measure of final status which is not correlate 
with initial status.” Thus, if we denote initial (or early) performance as 4 ; 

nal performance as Z, and compare these with some criterion, C, or with an) 
other test, then the correlation of effective gain with C will be :* 
Tiz~a) = Ize — TacTaz s 

Manning and DuBois do not indicate how one should correlate gain scores 
on one test with those on 


another. But we can extend the above formula 
to Tests 1 and 2 as follows : 


TZ, -A1) (Z2- A2) = Tayzy + TAIATA, Z TAZ; 
Each of the nine learning tests y 
each 


h of the two groups. With nine reference tests, 
estimates and 196] attainment, thi i i 
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the i ast : 
men (criterion) varjables were generally very similar ; hence they were 
y Correlation of Sums formula. The resulting validities are shown in 


Table 1. 
TABLE 1 


CORR 
ELATIONS SIN Y 

TIONS OF LEARNING TEST GAIN AND TOTAL SCORES AND REFERENCE TESTS WITH 
ATTAINMENT CRITERION. 


Gain Scores | Total Scores 


Learning Tests Reference Tests 
8-9 |10-11 


novage Learning | - : .810 | -598 | Vocabulary ..---- 
Verb ber Substitut'n| - . -655 | 787 | Reading Compr. .- 
Verbal E ous . : 563 :189 Mane Arith. 
W, ormation | * 3 -762 | -688 roblem Arith. 
Word Memory ...-] * ‘276 | -770 | -296 | Number Series 
Ge. ceptual Detail..| « c ‘517 | -198 | Digit Memory 
gome rical Form |- . -317 | -102 | Spatial Test -- 
aymbol Maze: || 4 : :465 | -367 | Goodenough Draw. 
euceRe Formation : -535 | -574 Bender-Gestalt 
1960 Attainment. . 


V.— RESULTS. 

It may be seen that the validities of the Total scores are similar to those of 
the Reference tests, though rather variable. None is as predictive as all-round 
attainment at the beginning of the year. By contrast, the Gain score validities 
are low; yet all but one are positive. Note that there is no consistency here 
between the two age groups, suggesting that the tests with near-zero validities 
o register much progress 


ere those which were too difficult for one group t 
(e.g., Number Substitution for 8-9 years), OT too easy for the other (Perceptual 
se the Standard Errors of correlations 


Detail for the 10-11 years). In any ca a 
based on thirty-six cases are SO large that inferences as to the worth of particular 
tests would be unsafe. For the same reason, neither multiple regression analysis 
nor factor analysis seemed likely to help in elucidating the results. Instead, we 

ula to yield the 


condensed the correlation matrices by Correlation of Sums form 
correlations among groups of similar tests. These are shown In Table 2 (8-9 year 


above, and 10-11 year below, the diagonal). 
TABLE 2 


oNG COMBINED GROUPS OF VARIABLES. 


INTER-CORRELATIONS AM 


Yerbal Gain reme 5 
von- Verbal Gain .. 4 -491 
yebal Total 5 | -595 
Vee Total 4 | 411 
Nirbal Ability 2 | 371 
Stumber Ability 4 | 622 
Tpatial Ability . 3 | -208 
19 50 Attain. ... 1 442 
961 Criterion 2 -512 
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All the gain score correlations with other variables are noticeably lower in 
the 8-9 year than the 10-11 year group, indicating poorer reliability and greater 
specificity among younger children. Even the combined Verbal Gain measures 
(Nos. 1-5) correlate only 0-131 with the combined Non-verbal measures (Nos. 
6-9), as against 0-491 in the older group. In contrast the two sets of Total scores 
show very similar relations to other variables in the two groups (except to the 
non-verbal reference tests; the Goodenough, Bender and Space test had very 
little in common among 10-11 year olds). Thus, overall performance at the new 
tests is quite predictive in the second year of the junior school, but children of 
this age do not show as consistent an ability to make steady progress at un- 
familiar tasks. 


Secondly, it may be noted that Gain scores tend to correlate more highly 
with Totals on the same tests than with any other variables. Children with gocd 
ability in any task do tend to progress more rapidly at this or similar tasks. 
The same point is brought out by the mean correlations of the nine Gain scores 
with : 


8-9 10-11 
Other Gain scores ............ -051 -141 
Own Total scores ............ +375 -472 
Other Tots: sete sb duiecec ae “113 -174 
Reference tests and other variables -111 "215 


It would follow that rate of progress on any one task will not tell the teacher 
or psychologist much about performance on any other task, particularly yika 
younger children. At the same time, Woodrow, Stake and other writers are 20 
justified in regarding learning ability as almost wholly specific. For examp!® 
the mean inter-correlation of 0-141 among 10-11 year olds implies that ou" 
battery of Gain scores would correlate 0-60 with another similar battery ; an¢ 
doubtless better tests could be devised. Moreover, a certain amount of pattern- 
ing among the Gain scores may be observed in Table 2. In both groups the 
Non-Verbal Gains correlate more highly than do the Verbal Gains with the 
spatial-pictorial reference tests. However, the Totals for the learning tests 
show this patterning more consistently. 


Prediction of Attainment. The matrices were now further condensed by 
combining all Gain scores, all Totals, and all Reference tests (but omitting the 
Perceptual Detail Learning and Goodenough in the 10-11 group). Multiple 
correlations with the 1961 criterion were calculated, and the beta weights were 
as follows : 


8-9 10-11 
Gain scores combined ........ —-112 -093 
Total scores combined ........ -546 —-003 
Reference tesis sis wawa ssc eare 231 -198 
1960 Attainment ............ -285 -613 
Mülüple R sa.. uswa -939 -845 

* This might be expected from the derivation of the Gain scores, since the coreo 


of Gain with Total = tzr—rarraz and this expression can be zero or negative oi 
Tar exceeds rzr. In ‘the light of Fleishman's work, it seems unlikely that a per per- 
initial performance could more closely resemble his overall success than does his fina 
formance. 
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Tl i 

eats acces differ markedly for the two groups. In the 10-11’s, the previous 
fothing, but ser is much the best predictor. The learning test Total adds 
very eo . te erence tests cover some more ground, and Gain scores make a 
Mothing to hs itional contibution. In the younger group the Gain scores add 
TAR lease other predictors ; indeed, their beta weight is negative. Previous 
ing Test T nemt is somewhat less diagnostic than at 10-11, but both the Learn- 

st Totals and the Reference tests do contribute appreciably. 


cores on the learning tests would 
form of Gain score but might also 
both the Total score and rate of 


not Sr pee possibility is that the Final s 
bethe 7 be much more convenient than any 
Progress ost predictive type of score. Inso far as 
Sie lane depend to some extent on the rather chancy initial reaction to an 
certain iad situation, it might be argued that the performance reached after a 
scholasti mount of practice at the new task would be a better indicator of 
Cotrelatic, capabilities. This is borne out among the 10-11 year pupils, the 
higher tt n of combined Final scores with the criterion being 0-705, which is 
that it or 0-661 for the combined Total scores. Note, however, from Table 2 
attainme a below the validities of the combined Reference tests and of 1960 
proiiisine i though it could probably be considerably improved if the more 
all give ie earning tests only were used. In the younger group the Final scores 
Too ower correlations than the Total scores with the criterion, though here 
, some of the tests seem promising, €-8- Learning 0-797, Verbal 


nformation 0-719. 


Language 


VI.—CoNcLUSIONS. 
allearning which might be more predictive 
i ' tests has proved some- 
e second junior year, 


ee more successful among older children. Apparently, such tests need to be 
fe: carefully to the level of ability of the particular group of pupils, otherwise 
ee ount of progress made in a limited time 1S highly unreliable and specific. 
eens eae it is not true, as some writers have maintained, a x 
laa y: When Residual Gain is measure ble genera earning 
ara appeared among the older children, and there was $ 
noni factors for learning different types o 
Svare Pal, etc. Each Residual Gain score also co: 
success at that particular task. 
oe predictive value of the various learnin 
ouge sige: the Rete hat 3 battery of success 
i ch more exploration with lan 
vould appear that ave of progress OF gain scores possess 1 rent ae 
saan the final scores achieved after instruction and practice ane ate a 
ive that yield the highest validities, and seem most likely to a o the predic 
value of of conventional tests that are applied without such practice. 

Usefu itati vations can be made during the tests of pupils’ 
lenthods of quale eatin, etc. However, the outlook for individual 
ven ing tests Which could be applied in the educational clinic does not appeat 
Ade. promising, both because these would have to be very lengthy to allow 
iaat practice, and because a child’s reaction to the learning situation in the 

lc setting might differ considerably from his reaction 1n the classroom. 


of Pe hey search for tests involving actu 
what ure scholastic achievement than ) 
at disappointing, particularly with children 1m th 


g tests that were tried out varied 
ful ones could be devised; 
s is needed. Finally, 
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CRITICAL COMMENTS 


T 
EACHING EDUCATIONAL PSYCHOLOGY IN TRAINING COLLEGES* 


T : : 

fie Ra Report ofa Joint Working Party of the British Psychological Society and 

1959 to er ong Rag Teachers in Colleges and Department of Education set up in 

Téecommen anne, teaching of psychology to training college students and to make 
I 


By T. H. B. HOLLINS 
(James Graham College, Leeds.) 
of the B.P.S. and the A.T.C.D.E. has produced 
of an analysis of the results of a questionnaire, 
colleges in January, 1960, and recommenda- 
1] without being damning and the suggestions 


Sere ; they are not too idealistic to be practicable. The report is short and 
gent; it is remarkably free from the hazy, high-sounding generalizations which 
TH : ticularly from committees. f 
colle pounce presented here of the teaching of educational psychology 17 the 
seat at the end of the two-year course, isan unfavourable one. First, the pro- 
Speni n of unqualified lecturers teaching psychology is too big—39 per cent. (I shall 
ane 5 in the present tense, tion has changed markedly 
moive 1960) : and the position is worse tl ndicates because, of course, 
tlle colleges have a numbe: 
the ‘Bes have none at all. I s 1 i 
of report shows, from an analysis of examination papers tio} 
ë all the A.T.Os., that the teaching is unsatisfactory ; | s out-of-date, 
see questions on McDougall’s list of instincts ; most of it does not take account of 
ese work, e.g., there were very few questions on Piaget’s contributions ; . an 
moe were too many questions Which could be answered merely by describing 
3 hool experiences. In short, “ there was little evidence ..- that students were 
xpected to possess specific knowledge in educational psychology as they might be 
expected to possess it in geography Or physics or English.” In view of these con- 
clusions, it is cold comfort to find that “ the allocation of time to the teaching of 
educational psychology i >. it is about 10-15 per cent. of the whole 


college course, and ane a ‘the Education course. 

The picture would be even blacker, I think, if the analysis had gone further. 
A closer examination of the qualifications of specialists 1n educational psychology 
would reveal that too many of them are inadequate. Too many diploma courses 
allow students to qualify by theses describing children’s behaviour on a basis of 
inadequate theory and a lack of rigorous experimental design with appropriate 
statistics, Wiseman} pointed out some of these weaknesses 10 1959; he analysed 
theses and papers in educational psychology and showed how few of them were 
concerned with the fundamental theory of learning and child development, and Bow 
strong was the bias towards descriptive studies © ehaviour. Again, I am sure the 
pen ig of the answers to the 1959 


committee y 

vould have been saddene i s ` 
lace, in my €x: erience, the average 

Ta the AE i estions out of ten, 


pee ie aa instead of the questions. | 7 
ent, answering a paper in W h he has to choo rat Of 
cose, €-g~» On 


avoids an f E A as 
any question 1m which he may d chi 

z rer to the four he does , f 
e the same At ye ; on learning—the essentials 


ee he tends to giv j % 

in Ky N eGil i i 

ate ting maladjustment ay: in ne: Y fe rubric were to forbid the mention © 

Fey * few would be able to answer at all, at least, amongst students training or 

N e bit f the B.P.S 

* Copi available from the 0 ces of the B.P.S. 

me m ae aie, re Cla or the A.T ‘C.D-E., 151, Gower Stree 

st free). r j à 

2 t als S. (1959) Trends in educational psychology- Brit. J. Educ. Psychol., 
9, 128-135. , S. 5 
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Some excuse can be found for the shortcomings of the teaching if one considers 
such factors as the small amount of time available in the two-year course, the 
demands made on lecturers in most colleges, which prevented adequate study, the 
poor intellectual calibre of some of the students (and this report remarks on our 
tendency to spend a disproportionate amount of our effort on the near-failures), and 
the gaps and weaknesses in psychological knowledge. A worthwhile psychology text 
is difficult (particularly to women students, whose experience of mathematics has 
frightened them off any information presented in tables) and if students are getting 
nothing from their reading (or, worse still, learning off the text by rote) we are 
forced to increase the practical work (in itself, an essential approach to psychology 
and children) to the extent of leaving out most of the theory. Brearley,* in 1957, 
gives an outline of a practical course for ‘ primary ’ students and she concludes, 
realistically, no doubt, ‘ Comparatively few students are ready to understand fully 
the tenets of the major schools of psychology, but we have greatly failed if at 
least some of them are not ready to study them when they leave.” Collier,t in 
1960, working with men students, reports ‘' A difficulty is getting students to rea 
effectively ...; it is the quality of reading that troubles us” and finds that theory 
and practice only come together in ‘ Study-Practice’; he makes a good case for 
a variety of methods, mainly practical, in this article, and some of his ideas are 
incorporated in a valuable section on methods in this report. As for psychology 
itself, the report rightly warns us to make“ a cautious use of psychological 
theory ” ; but this is putting it mildly. We have some certainties in the field of 
abilities, but in the field of learning, and personality structure and development, 
which are central to the teaching situation, we are among the “ Two-and-Seventy 
jarring Sects. ” In addition, nearly all the terms we wish to use in educational 
psychology have built-in value judgments, which lower their scientific value ; the 
report regrets that so few questions are set on ‘ motivation,’ but maybe the 
lecturers, after reading a few pages of Peters’ Concept of Motivation, have shown @ 
wise discretion. Of course, the study of psychology can sometimes be as good an 
education as the study of literature ; both will show the complexity of life and the 
unlikelihood of finding easy solutions to problems ; both might produce a desire to 
solve the problems, nevertheless ; but few student-teachers are likely to reach this 
level of sophistication. Some lecturers, too, become tired of keeping up with what 
seem merely fashions in psychology. 

However, signs of discouragement are, in fact, seldom observed. The compara- 
tive neglect of many areas of educational psychology in the training colleges, up t° 
1960, when the three-year course came in, was due, not to pessimism about the value 
of psychology, but to an optimistic concentration on an educational policy which 
took up such psychological contributions as reinforced the policy and neglected the 
rest. This policy was, and is, to make the schools child-centred. A training college 
educationist, especially in primary work, would see the task as persuading the 18-yeat 
old student to change from the concept of learning she would have gained in het 
recent education, i.e., close study of a subject, to a concept of individual children 
learning in a controlled but permissive environment where social and emotion 
learning are as important as intellectual. 


In this atmosphere, gentle but insistent pressure is put on both student ae 
child to adjust to the norm of the ‘ happy individual ’ which seems to be the idea ete 
the clinical psychologist. McDougall’s theories seem to give psychological resp sa 
ability to this aim—all the instincts and emotions working together, in the € 4 i 
under the happy jurisdiction of the master-sentiment. Freud will not do; seX SEA 
aggression seem always about to overcome reason and civilization ; or, if repress 
is complete, neurosis seems inevitable ; and there is a distasteful pessimism hing 
dictum ' The function of education is to inhibit, suppress and forbid.’ I am laug 

* BREARLEY, M. (1957). The students’ understanding of children. Educ. for Teach 
No. 42, 13-18. Brit J: 

+ CoLLIER, K. G. (1960). The teaching of psychology in training colleges- aS 
Educ. Psychol., 30, 103-108. 
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only at the e 
lindor xcesses of the movement and deplorin i 
in : g only what 
Hag helped E E of course, one must have only ee ae sade 
up fully with pa hep our fine infant schools and which, even ifit has AoE ak, a h 
psychological ee ous pumami aF practical understanding of EE 
through ul a individua ifferences, reading-readine i 
Galoredtine books experience T and which e a flow o ae aad 
hem, or teaching reading and number, with efficient methods for using 
What i 
Olilook pall dager of course, is to retain the benefits of this crusading educational 
add to it the benefits which come from the more dispassionate 


scientifi 4 
Itis hare te a which takes a cool look at the evidence and believes in experiment. 
at a deeper, more searching, study of psychology in the training colleges, 
res, would be of 


led b 4 

Valiie Ty Eeychoe properly trained in experimental procedu: 

Reeser a conditions for this to come about are more favourable than in 1960. 
uction of the three-year course will give students more time to mature and 


to i 
gain a better understanding of psycho 


etermine r f $ pa 
mined effort, in the institutes and colleges, to ensure 
The institutes have, for many years now, 


eciable proportion of students 
hat, owing to the inability of 


ti i á 
a highly intelligent students, although the range of ability 
one. It is against this backgroun' ations of the report 
d practicable. All of them are important, but the 
d development 


hich would contribute most to a rapi 
wth of a scientific outlook amongst 


the staff should have a degree in 
's, Master's Or Ph.D. degree in 


ast one member of 
rse is in the field of educational 


lent in a Bachelor 
ain content of the cou 


co-operative task in most 
bject should work 


psychology. 
ducational psychology, being a 
knowledge of the 


(2)... the teaching in e 
colleges, requires that me rned with the su 
as a team and the fullest use should be made of the specialist 
lecturer best qualified in educational psychology. 
i ist for obtaining further 


(3) More use should be made of the facilities which exis 
(The facilities are sum- 


qualifications through full-time @ 

(4) ane in Appendix 6.) 
Ve press strongly for the inclusion of more advan 

Tional psychology for the abler students. i 
avail tis particularly desirable ed courses 1n psychology should be made 
of Aes to the more gifted students. Although the chief reasons for this are mainly 
trai e idealistic kind, already i it should not be overlooked that the 
eg colleges are hoping to have their certificate recognized in some way by the 
cou. ersities and even to have their better stud ig on. degree-standard 
Bets in Education. This would link up with the growing demand to have 
ERE and particularly educational psychology, 35 a first-degree subject in 
sities. 

ob: Of course, no startling ad n be looked for immediately. The chief 
stacles to the development © as a science lie outside the ranks of 
Some of these are—the indifference, 


S k 

Sey chologists and ‘eaining-college lecturers. 

Psy ven contempt, of universities, PU i Is and grammar schools towards 

chology, which means it will continu per share of the 
earch, for some years yet; the large classes 


top- 
P-level of ability, and of funds for res 
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in schools, which force the teacher to be unduly concerned with class management 
and keep him suspicious of a science which thrives on the variations amongst 
individuals; and the unenlightened teaching of mathematics and the physical 
sciences in schools, particularly to girls, which results in the inability of most 
students to study psychology beyond the anecdotal stage. Nevertheless, this report 
makes useful suggestions for another step forward. 


II 


By J. M. THYNE 
(Jordanhill College of Education, Glasgow.) 


The Scottish colleges of education were not involved in the ATCDE/BPS 
inquiry, but the Report is, of course, of professional interest to them, and to their 
psychology departments in particular. I comment, however, as the representative 
of only one, albeit the largest, of them—with a present full-time student population 
of over 2,500. 

With the first part of the Report, most of my colleagues in the other colleges 
would be, I think, in general agreement. Child development (social, emotional, 
physical, intellectual, moral) ; the nature and conditions of learning as they relate 
to habits, skills, remembering, understanding, and transfer ; the psychology of the 
basic school subjects ; intelligence and its measurement, and applications to 
educational issues ; these have long been major parts of the psychology syllabus for 
women on the three-year course. (Our departments of psychology are concerned 
also with graduate students, with students holding diplomas in art, music, domestic 
science and other technical subjects ; but women on the three-year course form the 
largest group of students—over 1,600 at Jordanhill, this year—and it is to them that 
my comments here refer.) 

What to include in the psychology course is, nevertheless, a major problem, 
not because it is difficult to find material relevant to the training of teachers, for it 
is not, but because of the difficulty of deciding what must be left out. The relevant 
material has to be arranged in a hierarchy of relevance, and a line has to be draw? 
in accord with the time available, the claims of other subjects, the abilities of 
the students, their own psychological maturity, and other things besides. In 
contrast with the Report, the line I draw excludes the training of students to apply 
tests, but it does include an attempt to cultivate in the student an understanding of 
the history, structure, and function of tests, and an ability to interpret in education@ 
terms the results of tests which will be applied to their pupils by persons with greater 
expertise. This example raises the question of the criteria to be applied in selecting 
psychological topics, a point raised again in my concluding paragraph. 

Tam in sympathy with the caution expressed in the same section of the Report, 
though I have some reservations. As the Report says, argument by analogy mor 
animal learning, and from highly controlled experiments on human learning, iS © 
course, dangerous. But I, and many of my colleagues, would regard a deliberato 
attempt to minimise this danger as one of the fundamental tasks of the psy cholog15 
who undertakes to teach his subject to teachers. It is his special responsibility 
(not necessarily shared by his fellow-psychologists in universities) to ensure ha 
what he says about learning (for example) is, in fact, valid for the kind of situatio 
which will confront his students. And he will try to do this, not by ignoring th 
material in question, but by restructuring it, and testing in the school situation ing 
results of his restructuring it. For example, analysis of human habit-form ie 
indicates that its conditions of occurrence (if not its content) are basically the, Bes 
as those made explicit in classical and instrumental conditioning. Also, it ions 
difficult to show that certain psycho-analytic mechanisms, such as Projec E 
occur within the lecture-room and the classroom. Caution is not enough ; it Tao 
lead to constructive thinking. Admittedly, the touchstone of practicality ene 
only to give reality to what is taught but also to reveal the sterility of what one m 
otherwise have taught. It is also a humiliating revealer of one’s own ignorance. 
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re 
member? Is the material up-to- 


f Critical Comments IQI 
I ami i 

Hmeuieh bets bias me agreement with the Report’s emphasis on the need to dis- 
en educational psychology and educational philosophy, but I have 
distinction clear to all 


learn eras 
ed to be pessimistic about the possibility of making that 
ve a department of psychology and a 


stud 
Se ane all the Scottish colleges hav 
this se pons of education (an arrangement of which I heartily approve), but 
what the ay of segregation does not always make apparent to students 
education ce rence in function is. To take but one rather blunt example: in many 
EA minars students are encouraged (very properly, I believe) to express 
Seminar A Spam and develop their own judgements ; whereas in a psychology 
knowled y may have to learn that the opinion of a student, as contrasted with his 
ge, is of little or no value. What matters here are the k 
o learn also that the 


t! s 

ERT] by reference to them. 

lsrandation oi two or three children does not provide an adequate basis for the 

can, of of general hypotheses about child development. (Such observation 
, of course, be very useful for other purposes.) 

nt and forms of psychology 


On the central part of the Report (the actual conte! 
ven when accompanied by tables 


Ctirees| itia di 

ores) F is difficult to comment. Lists of topics, € 

in detail a o not answer the questions that really matter. What topics are expounded 
just what na not merely mentioned in passin in a somewhat general way? And 
referred letails are emphasised the syllabus, or iS 

to in lectures, seminars, and discussions, is cted to know an 

ate? Even the analysis and classification of 
le task) reveals remarkably 


e Fate 

ie questions (which must have been 

Ap about the above issues, though some of the general questions quoted in 

Me aaa 4 seem to imply that some parts of the courses mM 
ained little detail. I am tempted to suggest that more precise in 


h Py 
ave been obtained from an analysis not only of the examination ques 
hologists could scarcely object to a procedure 


o 
a students’ answers. Good psyc: 
y used within their own discipline. 
iny = robably the major difference between the Scottish colleges and the colleges 
ANI ved in the Report relates to tl i f the teachers of psychology: 
de ecturers in psychology in the Scottish coll ducation have an honours 
ra of which psychology isa main subject. ned teachers, with experi- 
I ce of teaching. Some have worked in child guidance clinics. In such a context 
aon bound to comment that the recommendations of the Report in this connection 
e unduly mild, though I do appreciate many of t 
The problem of suitable textbooks is serious, 


many of the te i y have lit 
xts which are sound on ps chology have 4M" 
van of oas which are realistic about the classroom 
ys would use tha’ 


sychologists nowada; i 
k of educational 


that the ‘ enera 
: p When reasonably compact 


word. My own experience suggest tt 
m training colleges- ) 
ing, Child Development, Intelligence) 


ey may have to disp : Sema 
xts on speci i h as Memory, Lear ing, 0 
are ae gees Hf than the thinly spre l 
volume, The“ specie tion but also of special- 
lisation. 

I have not attempted to ass comments 0n all parts of the. ort, for such an 
attempt would ae more ace han it would deserve: think that the typa o. 
investigation described i Report is valuable, and the on respor chat 
Ra it merits our thanks. Now: without in any way : a Pe in a neo T 
geport, may I suggest that a further tin might profitably °° underta n, y 

© discover the major problems actually faced D: lecturers 10 psycho ogy, an se 
Ways in which they have been tackled ? T example, what criteria are cited 1n 
Selecting psychological material for a course ? And why ? Is there an optimum oe 
Of presentation? How do we solve the problem of breadth versus depth ? But i 
Would be part of soch an investigation to discover what questions should be asked. 


yof relevant selec 


COMMENT 
ON 


THE RELATION OF NEUROTICISM AND EXTRAVERSION TO 
INTELLIGENCE AND EDUCATIONAL ATTAINMENT 


A brief comment on the controversy between J. B. Biggs and R. Lynn. 
By H. J. EYSENCK 


In the controversy between Lynn and Biggs, the crucial part appears to be 
played by the exact status of ‘ anxiety.’ Biggs assumes that ‘anxiety’ and 
“neuroticism ’ are used interchangeably by me, but this is incorrect. Neuroticism 
is regarded as an inherited psycho-physical disposition, closely linked with the 
lability of the autonomic system, which governs a person’s emotional reactivity, 
and may predispose him to the development of neurotic disorders under suitable 
circumstances. Anxiety is a conditioned fear reaction which is particularly character- 
istic of dysthymic neurotics, i.e., of persons who are high on the factor of neuroticism 
and also on the factor of introversion, which is significantly correlated with con- 
ditionability. Anxiety, therefore, is a ‘ mixed’ concept, being related both to 
neuroticism and to introversion. (The position is rendered rather confused by 


Cattell’s use of the terms ‘ neuroticism ’ and ' anxiety,’ which is exactly the opposite 
of mine.) 


My theory, which links introversion and conditionability, is not necessarily 
antagonistic to that of Spence, which lin! 
conditionability. On the theoretical level either theory, or both, or neither, might be 
right. My belief that Spence’s theory is inadequate is based on the experimental 
evidence, and may very well be mistaken ; the experimentally demonstrated 
correlation between the Manifest Anxiety Scale and tests of introversion make al 


the Spence-Taylor experiments using the M.A.S. equivocal. In our own work we 
have never i ici i 


ability, and neurotics as a 


I agree with Biggs that a great deal more work is needed at the theoretical 
level, but I feel even more strongly that what is needed is a greater concentration 
on experimental studies, with particular reference to parametric investigations. The 
frequently contradictory results obtained by investigators testing both Spence’s 
hypothesis and my own suggest that neither hypothesis is universally valid, but 
requires to be closely circumscribed with respect to its validity by detailed specifica: 
tion by such parameters as strength of CS and UCS, CS-UCS interval, proportion © 
reinforced trials and so on. Lynn’s study appears to me to bea logical deduction 
from principles which appear to have at least some experimental support, although 
this support may be rather circumscribed in the manner just outlined, and although 
it may, of course, be possible to account for his results in other terms than thos 


premised in his theory. This hardly detracts from the value of his important 
demonstration. 
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CRITICAL NOTICES 


Hunt, 
NT, J. McV i 
J. McV. (1961). Intelligence and Experience. New York: Ronald Press, 


pp. vii+416, no price given. 
By P. E. VERNON. 


This i i 
s an important book on at least two counts. First, Professor Hunt puts 


forward t 

inborn As stronge case yet made for discarding the conceptio: 

ofenvicoume which is predetermined by the genes and which ma 
ntal conditions. Secondly, he provides the most complete exposition 


available in Engli 
e in English of the whole corpus of Piaget's theories and experiments on 
him to conclude that intelligence should be 


men 
onal nee lori which lead 
develop age central processes comprising strategies for processing information that 
e he course 0 ith his environment.” 
enerali arpaa that predeterminism hi tal psychology 
Teaming and. example in the conv veen maturation an 
children’s i f at the advice 81” mental hygienists not to‘ push” their 
common px ellectual growth. its i i 
intellie action to experimental demonstrati 
ETTI (e.g., those O i 
Se am controversy. 
Pia. Pong central pro 
TA s descriptions of 
kae ‘ope developmen 
changed sets, and many oth i evidence, the w 
age Besides describing 
‘ary evidence, such as Ges 


timulation, 
d assimilation in the 


tic 
on, and shows how these can 
can mae does not deny that genetic factors set a limit to the co; xity of} 
e learned, but his interpretation of the work of modern neticists leads him to 
netic co t either in physica 
nvironmental 


d 
Satie we can ever isolate a f 
varlanc tal development. Atter ts to apportion 
best ee (e.g., the often quoted 80 per cent.—20 P' 
bein: hey only reflect the em 

g studied, and tell us nothi ut wha 

* tigations of more extreme env! 
redict that the application ofa 


m 
DAA conditions. Many investi 
aes far greater effects, and he goes. so far s to t t 
psychology of development could raise the average intelligence bY an amoun! 
I he review er’ opinion, this forecast is un- 
s that inhibit than those that 


Sy aes to 30 points Of -x rs 
Facey rash ; we know far more about the con ition: ; 1 
ace the full realization of intellectual potential. nd, although Piaget's work has 
ou own a flood of light on cognitive processes i does little or 7° 

r control of the even more important motivational ones. 
tot The main key to such im ims, i og of environment 
re ue current level of the child’s schemata ; t 4 
e alized by the good school teacher): Jf the child is set problems which merely 
gasra schemata that are anized, he gets bored ; while, if they are too 
T ahead of him, he is unin ereas with appropriate choice 
a stimuli, there should positive pleasure in i y the baby's 
Beason of talking an i nces which are appro) 
nie may inhibit at another ; ‘or examp!©: her may 
F pence adequate experience in the imotor stage, ye deprive her | 
= © preconceptual (144 year) stage. Variat another import- 
nt principle in the develo ent of flexible and transferal 
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These educational implications are among the most original parts of the hoolt 
For the most part it covers ground which is already fairly familiar to sorea ona 
psychologists in Britain (though perhaps less widely known in America). Asis ony 
to be expected from a polemical treatise, the author’s enthusiasm sometimes a 
away with him. For example, it would have been helpful if he had given Te 
consideration to the strong counter arguments that have been put forward by 


adolescence). The more carefully controlled studies of Peel and Lovell, for example, 
indicate much more variation and inconsisten i 


, although fairly technical an parts, it is a good deal easier 
to follow than Hebb’s or Piaget’s own writings. 


advanced students and to lecturers in education 


THE APPRECIATION OF BEAUTY 


VALENTINE, C. W. (1962). The Exp 


erimental Psychology of Beauty. London : 
Methuen, pp. xiv+438, 45s. net. 


By CYRIL BURT 


Professor Valentine’s aim has been to present 
ments on the appreciation of art, music, and poetry 
Psychological differences between individuals he 
ences in what is known as ‘ artistic taste.’ 
these various researches have frequently been children of 
apart from the intrinsic interest of the wi 


that it denotes “ something that is worthy to exist,” 
what we Prefer, like, or admire.” But once again, th 


e 
alentine tells us in his final chapter) is that oe 
i “delighted absorption in the aes 
easingly clear. ‘‘ Art,” as Herbert Read too 
beauty: that cannot be said too often oF 


— 
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blatantly.” es 

REY. Nor is the: i i 

emotion T asthi re any unique type of feeling that we can label th i 
The experience seems rather to be a perception or pas gerie 


of some parti Š 

by“a seo quiy or combination of qualities, which isas a rule accompanied 

complexity of ee alend of feelings and a certain attitude of mind.” The unexpected 

are perhaps tt he w ole process and the astonishing number of factors that affect it 
he most instructive conclusions of the entire series of investigations 


th 
at Se Valentine describes. 
the Pod T divided into three parts—the first dealing w: 
simplest t with auditory, and the third with verbal. In each he begins with the 
Accordin res of stimuli, and advances progressively to the more complicated. 
gly, after a brief introduction, his first chapter starts with experiments on 


sim 
oth R le colus; and reports the results of numerous studies carried out by himself and 
raceesaind cease eferences among infants, school children, and adults from various 
during carl mere So far as comparable data can be obtained, it appears that 
red, an 2 childhood vivid colours are everywhere strongly attractive, and that 
Breen or Ee and yellow are generally preferred to green, blue, or violet. Later, 
ue tend to predominate, and white rises high in order of preference. 
ividual differences, and detect the 


Here at 

Silk hap ged outset we encounter marked indi 

more aa a variety of factors, similar to those we meet with in experiments on 
as they p ex objects. Thus, in one set of investigations it was found that children 
partion es] ow older become more and more influenced by personal association, 
Saturat i y in the case of colours that are unsaturated or ‘mixed’ rather than 
‘ ated and ‘ pure’: one boy, ss he likes a pale faded yellow-green, 


ith visual experiences, 


for example, Say 
if femi n f leavesin Autumn ”; another dislikes red, “ because 
inds him of blood.” These strate the commonest or “ associative type.’ 
“active ’ feelings the sight of colours arouse: “ this 

u feel hot all over,” “ the green 
alled ‘ character type’: 
i another of pink 
irely ‘ objec- 


yellow dazzles the eye,” “ that brig 
Somewhat rarer is the so © 
“ Joud and aggressive rae 


F girl, for instance, speaks of scarlet as 
tive rather demure.” And finally, a few among the oldest adopt an ent e 
attitude, and give reasons relating solely to the colours themselves : they like 
” and dislike a certain green “ because 


a A s 
an ultramarine ‘‘ because it is such a pure blue, l 
be a really good green. ” As Professor Valentine points 

very medium and a 


oes much yellow init to be £ £ 1 
aita ese four types, with varying frequencies, crop up in ever) t I 
su st every stage. Nevertheless, SO he maintains, fficient uniformity to 
in Seest, in addition to the effects of experience Or temperament, the influence of an 
jaa Ae differential response to colours of different hues, © 
ad he child matures. F lours we turn to col é 
canoe once we find that the m of another hue can greatly modify the 
ae ial preferences. However, the appreciation of colour-harmo: the distaste 
z colour-clash does not appear un: n years, according to 
ost observers, though with a few indi 
Ser rom colour we proceed to: 2 
inf preferences for certain kinds of lines, curves 
uence of Einfihlung or‘ empar? 
n the place 


the spectator, so to 
d shapes ; he feels 
horizontal oblong. 


Speak, s ‘ : š 
; , ympatheticall uts himself i 

syne moving along 7E graceful curve Or supporting the heavy f 

Subjective types ' and ‘ character types’ are thus strongly 19, evidence. ; Certain 

as ‘ lively vor“ layful,’, others as «harsh or cruel.’ Rising 

“weary OT“ sad.’ And the experi- 


zi 5 
ceas are described l 
m seem ‘ active ’ OT * hopeful ’; falling lines r E L 

ents sufficiently prove, if proof were neaded, that there 1° no single line or curve 


Which is unanimously preferred. The recurrent search D 
3 for a so-called ‘line of beauty’ ended, w 


jdogarth, and Winckelmann s 

e serpentine curve question-mark 

From the canons of Polyclitus and ward to the present day, 

numerous writers hav i at, as in music and poetry, So in art an 

architecture, the secret of be: lies in the com ponent proportions, and that these 

Proportions could be numerically expressed. The doctrine is most succinctly summed 
P in a dictum of Palladio 7 Quattro Libri dell’ A rchitettura, 1570) : “ the beauty of 


Vitruvius of 
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threshold curve, therefore, the match 
“diameter” was given the same inter- 
mediate value (13.5’) as for the supra- 
threshold curves, and “luminance” 
was assumed equal to the test- 
threshold luminance for that particu- 
lar diameter. The result of this, of 
course, is that the left side of Equa- 
tion 36 remains constant, which con- 
dition reflects the fact that the match 
field is at a constant “effect” level 
(of threshold). 

As can be seen in Fig. 20, the theory 

predicts the data in a satisfactory 
manner. At the low test-threshold 
luminance values, enhancement takes 
place. This is to be expected from 
Equation 36, since at low B, values 
the spontaneous discharge term which 
has B; in its denominator (see Equa- 
tions 20, 36 and 41), is relatively large; 
variation of Ca in Equation 36, 
therefore, should be effective. At 
high B, values, however, the spon- 
taneous discharge term is already too 
small for variation over the same 
Ct,a range to make much difference. 
Or, in speculative words, high test 
luminance inhibits spontaneous ac- 
tivity by direct and/or scattered 
light so that as test area increases 
there is little or no change in spon- 
taneous activity. Low test lumi- 
nances, however, allow spontaneous 
activity to be effective. At low test 
luminance therefore, spontaneous dis- 
charge inhibition is present and thus 
reduced by an expanding test area 
through off inhibition. Test bright- 
ness, in this way increases, or thresh- 
old luminance decreases. Such pre- 
dictions are borne out in Fig. 20. 

It should be noted that variation of 
internal test inhibition, Cz. is not 
very effective since its exponent c3 is 
equal to .3010, according to Equation 
39. The physiological explanation 
for this lack of effect is not obvious 


at this time. We might point, how- 


ever, to the data of Boynton (1951) 
to support the reliability of this find- 
ing. He measured the height of the b 
wave in electroretinographic record- 
ings as a function of the area of small 
test fields. He found that as area 
increased, the b wave response also in- 
creased. The interesting point, how- 
ever, was that he could account for 
the charge in b wave height solely on 
the basis of the change in the amount 
of scattered light hiding the retinal 
surround of the tes “eld. Presum- 
ably, changes of ~~~ of light 
striking within the u ers neither 
reduced nor increased the b wave 
response. Which is to say in our own 
theoretical terms, that variation of 
Ci, was not effective. 


Discussion AND CONCLUSIONS 


Our theory set out to explain psy- 
chophysical depression and enhance- 
ment in terms of physiological inhibi- 
tion occurring between on fibers 
(depression) and between on and off 
fibers (enhancement). How success- 
ful has it been? 


Negative Aspects 


We find that although the changes 
that occur (in the brightness response) 
in all the foregoing experiments have 
been generally predicted, it can be 
seen in Table 1 that exact prediction 
of the results in the experiments of 
Heinemann (1955) and Fry and Al- 
pern (1953) require values of con- 
stants K; and K; quite a bit different 
than those for the other experiments. 
It is doubtful that these changes in 
K; and K; can be attributed to un- 
controlled variables (e.g., personality) 
that might be operating differently in 
the Heinemann and Fry and Alpern 
experiments. It seems more likely 
that our theory is lacking in its de- 
velopment of the term S; in Formula 
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37. That is, that amount by which 
the inducing field surrounds the test 
field is substantially greater in the 
Heinemann and Fry and Alpern ex- 
periments than in the others. Before 
this term is developed in detail, 
however, it would probably be more 
appropriate to first know empirically 
what happens to test brightness as 
S; is varied. 

Another consideration is that our 
b and ¢ equations (37 and 38) are not 
rational. Nor are the O formulae 
(40, 41, 42) rational, even though we 
can point to their probable physio- 
logical bases. The improvement of 
this situation also awaits further ex- 
perimentation and knowledge. 


Positive Aspects 


On the positive side of the picture, 
our theory has integrated in an explicit 
fashion all the major changes that 
occurred in the physiological experi- 
ments and psychophysical multiple- 
and single-field experiments that were 
considered. And except for changes 
in Ks and K, the empirically de- 
termined constants remain relatively 
stable, except when, because of a par- 
ticular experimental procedure or lack 
of control (as in Beitel, 1936) a change 
in constants is predicted. 


Predictions 

An interesting prediction is sug- 
gested by the foregoing theory. i This 
has to do with the relationship be- 
tween enhancement and spontaneous 
discharge and the possible dependence 
of spontaneous discharge upon reticu- 
lar activation. According to our 
theory, changes in reticular activation 
could change the value of the term D 
in Equation 35 and, therefore, change 
the amount of enhancement present 
in either multiple- or single-field 
situations. Preliminary experiments 
indicate that this might be happening. 


An initial experiment by Diamond 
is investigating the correlation be- 
tween the amount of single-field en- 
hancement (as found in Diamond’s 
single-field area experiment above) 
and the amount of reticular activa- 
tion, as measured by blood pressure— 
Lindsley (1951) suggests this as a 
possible measure. Our theory would 
predict that the higher the reticular 
activation as measured by blood pres- 
sure, the greater the amount of single- 
fieldenhancement. The results, taken 
from a sample of 30 college students, 
so far show a positive and signifi- 
cant (at the .01 level of confidence) 
Spearman rank order correlation be- 
tween enhancement and blood pres- 
sure as predicted. The correlation is 
low but it still suggests support of our 
theory; i.e., factors other than reticu- 
lar activity might be causing blood 
pressure differences from S to S, thus 
reducing the correlation. This ex- 
periment has yet to be performed for 
the multiple-field situation. 

A second investigation by Kleman, 
Diamond, and Smith is studying the 
relationship between enhancement and 
the administration of a drug that 
might affect reticularactivation. The 
drug that is being used is caffeine; it 
is a stimulant and also safe to use with 
college Ss. Heinemann’s (1955) mul- 
tiple-field situation is employed (see 
Fig. 11) in which enhancement ap- 
pears for the low inducing- and test- 
field luminances (see Fig. 12). Our 
initial findings, in Fig. 21, show that 
caffeine administration causes en- 
hancement to disappear. 

Alternative theoretical explanations 
seem possible at this point. A first 
hypothesis is that caffeine stimulates 
the excitatory portion of the reticular 
system; this stimulation is relayed to 
the retina, where, as a result, on fiber 
activity is increased more than is off 
fiber activity. Granit (1955, pp. 106— 


198 
107) has shown that on as well as off 
fiber activity increases with reticular 
stimulation. A relatively strong in- 
crease in on activity would weaken 
off activity and reduce enhancement. 

A prediction accompanying _ this 
hypothesis is that caffeine administra- 
tion should lower absolute threshold 
for a single field (since on fiber ac- 
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Fic. 21. The effect of caffeine on the test- 


brightness versus inducing-luminance func- 
tion. Note that enhancement disappears 
with caffeine administration. (Kleman, Dia- 
mond, & Smith, unpublished.) 
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tivity is supposed to increase). We 
find in a preliminary experiment that 
this does happen for some Ss but not 
for others. Why these individual 
differences exist in this respect is now 
being explored. An alternative hy- 
pothesis is that caffeine acts directly 
on the on fibers in the retina to re- 
duce enhancement in the above man- 


ner. A choice of the best of these two 
hypotheses awaits further experi- 
mentation. 
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A TWO-FACTOR THEORY OF INHIBITION * 


MERRELL E. THOMPSON 
New Mexico State University 


Perhaps no other aspect of Hull’s 
(1943) learning theory has been so 
thoroughly explored, both empirically 
and theoretically, during recent years 
as his theoretical treatment of ex- 
tinction and related phenomena. No 
fewer than 12 theoretical papers 
dealing with this problem have ap- 
peared in the Psychological Review 
during the past seven years (Cotton, 
1955; Ellis, 1953; Estes, 1955; Glan- 
zer, 1953a; Gleitman, Nachmias, & 
Neisser, 1954; Johdai, 1956; Maatsch, 
1954; Razran, 1956; Saltz, 1953; 
Walker, 1958; Wasserman, 1952; 
Wolpe, 1952). 

Gleitman, et al. (1954) presented 
a critical and important evaluation of 
Mull’s formulation in which they 
pointed out several paradoxical con- 
sequences which can be derived from 
Hull’s postulates, one being that the 
performance curve should rise to a 
maximum and with continued rein- 
forced trials eventually decline to its 
starting point; another being that re- 
conditioning to the original level would 
be impossible. Most of the criticisms 
of these authors were concerned with 
conditioned inhibition, a concept which 
many psychologists have held in low 
esteem for years.’ 

The most serious attempt at formu- 
lation of an alternative theory has been 


made by Glanzer (1953a), who as- 

1 This work has been supported in Part by 
Grant G-5817 from the National Science 
Foundation. An abbreviated version of the 
Present paper was given at the 1958 meet. 
ing of the Rocky Mountain Psychological 
Association. Appreciation is expressed to 
Jean P. Thompson for valuable assistance 
in preparing this paper. Thanks are also 
due Joseph H. Forsyth for critically reading 
the manuscript. 
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sumed that with continued exposure to 
the same stimulus the organism be- 
comes less active, i.e., stimulus sati- 
ated. Glanzer makes the additional as- 
sumption that this “boredom-like” ef- 
fect dissipates with time. From this 
theory are deduced spontaneous alter- 
nation and related phenomena. 

A critical evaluation of Glanzer’s 
formulation, together with numerous 
empirical studies, illustrates the limita- 
tion of the stimulus satiation concept. 
Glanzer, like Hull, can explain certain 
empirically known facts, but finds it 
difficult to handle others, For in- 
Stance, it is difficult to see how Glan- 
zer’s formulation can explain such a 
phenomenon as an increased propor- 
tion of left turns following a forced 
right turn and an increase in this ef- 
fect following two forced right turns 
(Grosslight & Ticknor, 1953; 
son, 1952). Glanzer (1953 
failed to verify Deduction 5 w 
pears to be basic to his theory. De- 
duction 5 „States: “In the simple 
two-alternative situation, as stimuli dif- 
ferentiating the alternatives are elim- 


inated, spontaneous alternation will de- 
crease” (p, 262) 


Thomp- 


a) also 
hich ap- 


e upon the develop- 
dissipation of inhibition. 
results of the reported 
r to be complicated by 
in such variables as the 
nature of the first trial in a block of 
two consecutive trials (free or forced) 
and reward for both alternatives or no 
reward in’ the maze. Both of these 
authors failed to differentiate between 


ment and 
The diverse 
Studies appea 
differences 
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reinforced and nonreinforced responses 
on inhibitory potential. The majority 
of studies cited in support of the stim- 
ulus satiation theory appear to employ 
nonreinforced responses. If this vari- 
able should lead to results which differ 
from studies using reinforced re- 
sponses, we might find that both Hull’s 
reactive inhibition postulate and Glan- 
zer’s stimulus satiation postulate have 
merit. ` 

In view of the conflict between em- 
pirical data and the deductions from 
existing theories of inhibitory poten- 
tial, a need is indicated for a more 
comprehensive theory than the one 
presented by Glanzer, one which also 
avoids the paradoxes derived from 
Hull’s formulation. 

The two-factor theory of inhibitory 
potential presented in this paper was 
first designed in an attempt to explain 
some of the apparently conflicting re- 
sults of studies on spontaneous alterna- 
tion. The theory is a modification of 
Hull’s reactive inhibition postulate and 
Glanzer’s stimulus satiation postulate. 
This revised formulation should not 
only enable us to take care of some of 
the contradictions in the theories pre- 
sented by Hull and Glanzer, but yield 
novel predictions, and help to explain 
empirical results from a number of 
apparently unrelated areas. 

Following the procedure of Hull and 
Glanzer, the theory is presented in the 
form of three postulates, as follows: 

Postulate 1. Whenever an organism 
makes a response,” there is developed 
an increment of inhibition which de- 
creases the probability of recurrence 
of that same response. This condition 


2The term response as used here refers 
to acts producing changes in the immediate 
environment or in the relation of the or- 
ganism to the immediate environment. Ex- 
amples of response as defined above would 
be turning right in a maze, approaching a 
food cup, pressing a bar, running down a 
maze alley, etc. 


is called response inhibition (Ip). The 
amount of Ip developed by a series of 
response evocations is a negative 
growth function of the number of evo- 
cations, and dissipates as a simple 
negative growth function of time. 

Postulate 2, Whenever an organism 
makes a response, or is exposed to 
a stimulus in the absence of primary 
reinforcement, there is developed an 
increment of inhibition which decreases 
the probability that the organism will 
return to or respond to the same stim- 
ulus, providing the organism has pre- 
viously been rewarded in the presence 
of similar stimuli (e.g., in previous 
maze training or during preliminary 
training). This condition is called 
stimulus inhibition (Zs). The amount 
of Is developed is a negative growth 
function of number of stimulus ex- 
posures. Stimulus inhibition dissi- 
pates very slowly with the passage of 
time and will generalize from one stim- 
ulus to other similar stimuli. 

Postulate 3. . Response inhibition 
and stimulus inhibition summate func- 
tionally to produce total inhibition 
(Ir) Implicit in the postulates are 
the following statements: 


1. Whenever a response is made 
and followed by reinforcement, only 
Ir will develop. 


8 Taking the modification of Hull's the- 
ory (as presented by Spence, Goodrich, & 
Ross, 1959) as a point of departure the 
above three inhibitory constructs could be 
incorporated as follows: 5 

In a simple two-alternative situation when 
one alternative is rewarded, the excitatory 
potentials of the responses to the two dis- 
criminanda (S,S-) are given by the follow- 
ing equations: 

E.,=H,.xX (D+K) —In 
E_=(H-XD) ~Ir 


where H=fNr (number of trials run), 
D=Td (time of food deprivation), K = 
fWg (magnitude of reward), Ir=fNr, Te 
=fn (number of nonreinforced trial runs), 
and Ip=In+ Is. 


202 MERRELL E. 


. 
7 


2. Whenever an organism is ex- 
posed to a stimulus in the absence of 
reward, only Ig will develop. This is 
based on the assumption that the or- 
ganism is not required to make a spe- 
cific response in arriving at the stim- 
ulus. 

3. Whenever a response is made 
and not followed by reinforcement, 
both Ir and Ig will develop. 

4, When responses are learned un- 
der conditions of partial reinforcement, 
both types of inhibition will develop, 
Ir on every trial and Js on the non- 
reinforced trials. 

The postulates yield a large number 
of deductions which are relevant to 
several experimental situations. Some 
of the more basic of these are formu- 
lated and discussed below. They are 
divided for convenience into two main 
categories: deductions relevant to 
spontaneous alternation, and deduc- 
tions relevant to extinction and related 
phenomena. The former are further 
divided into situations of constant stim- 
uli and situations of reversed stimuli. 


Depuctions RELEVANT TO 
SPONTANEOUS ALTERNATION 


In a simple two-alternative situa- 
tion, such as a T or Y maze, where 
the subject (S) is started from the 
same starting box on two consecutive 
trials, thus maintaining constant stim- 
uli, the following deductions apply: 

Deduction 1: Nonrewarded Ss will 
alternate more than rewarded Ss, 

Since only Jr is developed when a 
response is followed by reward, and 
both Ip and Is develop when a re- 
sponse is not rewarded, it would fol- 
low that the total amount of inhibition 
against repeating the turn and ap- 

proaching the stimulus would be 
greater in the latter case. A verifica- 
tion of this deduction was made by 
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Thompson* who found that nonre- 
warded Ss showed significantly more 
alternation (73%) than rewarded Ss 
(41%). 

Deduction 2: There will be a de- 
crease in alternation with an increase 
in intertrial interval. This decrease 
will be greater for rewarded Ss than 
for nonrewarded Ss. P 

The two-factor theory assumes that 
Ir dissipates much more rapidly than 
Ts. Therefore rewarded Ss will show 
greater decrease in alternation after a 
passage of time since all of the inhibi- 
tion will dissipate, whereas the non- 
rewarded Ss will lose their Ip but less 
of the Zs, thus showing less decrease in 
alternation after a passage of time. 

Solomon (1948) and Dember & 
Fowler (1958) cite many studies veri- 
fying the first part of this deduction. 
The second part of the deduction was 
verified by Thompson (see Footnote 4) 
who found that nonrewarded Ss showed 
a 16% decrease in alternation with 30- 
min, intertrial interval while rewarded 


4 Eighty rats were run on a cross-shaped 
rotating maze with two starting boxes in an 
attempt to study alternation as a function of 
three variables: (a) similar (black-black) 
vs. dissimilar (black-white) alleys and end 
boxes, (b) reinforcement in both end boxes 
vs. no reward in either end box, and (c) 
the first trial in each block of two being 
forced or free. By using a 2X 2X2 fac- 
torial design it was possible to determine 
the relative importance of each of the three 
main variables as well as possible inter- 
actions. 

Each S was given four trials, in blocks 
of two, each day for 12 days. The two 
trials within each block were given with a 
30-sec. intertrial interval and the interblock 
interval was 30 min. On half of the 24 
blocks the Ss were given their two trials 
from the same starting box, either Sı or Ss 
thus maintaining a constant position of maze 
arms. On the other 12 blocks, the Ss were 
started in the S, (or S2) box for trial one 
and then placed in the other box for trial 
two, thus reversing the relative position of 
the maze arms. 


a 
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Ss showed a 61% decrease with the 
same intertrial interval. 

In a simple two-alternative situa- 
tion, such as a cross-shaped maze, 
when the S is started from opposite 
starting boxes on two consecutive 
trials, thus reversing the stimuli, the 
following deductions apply : 

Deduction 3: Rewarded Ss will al- 
ternate turns and repeat stimuli. 

This deduction follows from the as- 
sumption that only /ę develops when 
responses are rewarded. Since no Iş 
is developed, there will be no tendency 
for the Ss to alternate stimuli. This 
implies that Hull’s reactive inhibition 
postulate is essentially correct if ap- 
plied only to situations in which both 
alternatives are rewarded. This de- 
duction has not been verified. 

Deduction 4: Nonrewarded Ss will 
alternate stimuli and repeat turns 
when the maze arms are dissimilar. 

The development of Jp would lead 
to the prediction that the Ss would 
alternate turns. However, Js will de- 
velop and accumulate and eventually 
overcome the effect of Jp; i.e., the total 
amount of /g after a few trials will be 
greater than the amount of Jp, which 
will dissipate between blocks of trials. 

Glanzer (1953b), using a black- 
white maze found 81% alternation of 
maze arms (stimuli) when his Ss were 
started from opposite starting boxes 
on two consecutive trials. Thompson 
(see Footnote 4) found nonrewarded 
Ss run on the black-white maze 
showed 60% alternation of stimuli. 

Deduction 5: Nonrewarded Ss will 
alternate turns and repeat stimuli 
when the arms are similar, 

Since Ig is assumed to generalize 
to similar stimuli, the Zs developed to 
Arm A will generalize to Arm B and 
since the amount of Js is approxi- 
mately equal for both of the alterna- 
tives, the Ss response will be deter- 
mined largely by Ir- Thompson (see 


Footnote 4) found verification of this 
deduction in the fact that nonrewarded 
Ss run on the black-black maze alter- 
nated turns 65% of the time as com- 
pared to only 40% turn alternation 
on the black-white maze. 

Deduction 6: Rewarded Ss will 
show a decrease in turn alternation 
and an increase in alternation of stim- 
uli with an increase in intertrial in- 
terval. 

The stimulus satiation theory makes 
the opposite prediction, i.e., a decrease 
in alternation of stimuliwith an increase 
in intertrial interval since Glanzer as- 
sumes that stimulus satiation effects 
dissipate with time. The two-factor 
theory, assuming that only Jp develops 
when the responses are reinforced, 
would predict a decrease in turn alter- 
nation as the Jp dissipates. Since the 
S can alternate turns or stimuli but 
not both, any decrease in turn alterna- 
tion must lead to an increase in the 
alternation of stimuli. Verification of 
this deduction was found in Thomp- 
son’s (see Footnote 4) study; the 
percentage of turn alternation for 30- 
sec., 30-min., and 24-hr. distribution 
was 40, 29, and 28 respectively. 


Depuctions RELEVANT TO 
EXTINCTION, REMINISCENCE, 
AND SPONTANEOUS RECOVERY 


Deduction 7: Extinction, as meas- 
ured by a decrease in spontaneous re- 
covery, can be strengthened by con- 
tinuing nonreinforced presentation of 
the CS after the CR has been extin- 
guished to a point of nonelicitation. 

This is sometimes referred to as 
below sero extinction and has been 
verified by Pavlov (1927) and Brog- 
den, Lipman, & Culler (1938). Ac- 
cording to the two-factor theory, both 
Ir and Is develop during the extinction 
trials up to the point of nonelicitation 
of the CR with Jr dissipating after 
this point has been reached. Contin- 
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ued exposure of the S to the stimulus 
(CS) in the absence of reward would 
produce still more s which leads to a 
predicted decrease in spontaneous re- 
covery. 

Deduction 8: After rats have been 
trained to go to one side of a T maze, 
the number of trials to extinguish the 
response may be reduced by placing 
the S in the previously rewarded goal 
box without reward, prior to extinc- 
tion proper. 

According to the two-factor theory, 
Ig develops when the S is exposed to 
the stimulus in the absence of reward 
(as during latent extinction). Thus 
the number of extinction trials re- 
quired are reduced. This deduction 
agrees with the experimental findings 
of Coate (1956), Deese (1951), 
Denny & Ratner (1959), Hurwitz 

(1955), Moltz (1955), Moltz & Maddi 

(1956), Seward & Levy (1949), and 

Thomas (1958). 

Deduction 9: Spontaneous recovery 
will be reduced when latent extinction 
precedes experimental extinction, pro- 
vided control and experimental Ss are 
given the same number of extinction 
trials. 

The amount of Ip developed during 
experimental extinction is approxi- 
mately the same for the two groups. 
However, the experimental group has 
more Iş, since some Iş was developed 
during the latent extinction period. 
Since Ig is assumed to dissipate at a 
very slow rate, the amount of sponta- 
neous recovery for the experimental 
group should be reduced. Thomas 
(1958) found no significant difference 
between groups in amount of recovery, 
but his experimental design was such 
as to offer no direct test of the above 
deduction. 

Deduction 10: When secondary re- 
inforcement is minimized, continuously 
reinforced Ss will show superior per- 

formance to partially reinforced Ss at 


THOMPSON 


all stages of acquisition when the two 
groups are compared in terms of num- 
ber of reinforced trials. The superi- 
ority of the 100% group will decrease 
with increase in intertrial interval. 

The habit strength of both groups 
is equal since they have had the same 
number of reinforced trials. However, 
the amount of total inhibition is greater 
for the partially reinforced group since 
the total number of trials exceeds that 
of the 100% group. With an increase 
in intertrial interval the J, dissipates 
for both groups, leaving the Js de- 
veloped on the nonreinforced trials for 
the partially reinforced group and thus 
decreasing the difference in perform- 
ance. Verification of this deduction is 
seen in a recent study by Reynolds 
(1958) who found that when perform- 
ance was compared in terms of num- 
ber of reinforced trials, the curve for 
the 100% reinforced group remained 
significantly above that for the 60% 
reinforced group. 

Deduction 11: The extinction curve 
of a continuously reinforced group will 
have a steeper initial slope than that of 
a partially reinforced group. 

This deduction is based on the as- 
sumption that Ig develops according 
to a negative growth function of num- 
ber of nonreinforced trials and dis- 
sipates very slowly. Since the par- 
tially reinforced group will have some 
Is developed during the nonreinforced 
acquisition trials, the increment of Is 
per trial will be smaller during the 
early extinction trials than for a con- 
tinuously reinforced group. Verifica- 
tion of Deduction 11 is found in many 
studies, among them James & Rotter 
(1958), Reynolds (1958), Sheffield 
(1949), Weinstock (1954, 1958), and 
Wilson, Weiss, & Amsel (1955). 

Deduction 12: Reminiscence will be 


complete for Ss trained under con- + 


tinuous reinforcements; i.e., following 
Test, the performance curve for a 
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massed group will be the same as the 
curve for a distributed group. 

Since only Jp is assumed to develop 
on reinforced trials, reminiscence 
should be complete as Ip dissipates 
rapidly with the passage of time. Sev- 
eral studies on motor learning (Adams 
& Reynolds, 1954; Archer, 1954; Bilo- 
deau, 1952; Schucker, Stevens, & El- 
lis, 1953; Starkweather & Duncan, 
1954) have failed to verify Kimble’s 
(1949) reported evidence for perma- 
nent work decrement on postrest per- 
formance following massed acquisi- 
tion trials. 

Deduction 13: Reminiscence will be 
incomplete for a long period of time 
following acquisition under partial re- 
inforcement conditions; i.e., following 
rest, the performance curve for a par- 
tially reinforced group will be below 
the curve for a continuously reinforced 
group, provided the number of rein- 
forced trials is held constant and sec- 
ondary reinforcement is minimized. 
The amount of this “permanent” work 
decrement will be proportional to the 
number of nonreinforced trials, 

This deduction is based on the as- 
sumption that Js developed on the non- 
reinforced trials will dissipate very 
slowly, thus prolonging complete re- 
covery of the performance curve. 

Deduction 14: Spontaneous recov- 
ery will be less for a group which 
learned the response under partial re- 
inforcement conditions than for a 
group which learned under continuous 
reinforcement, provided both groups 
are given the same number of extinc- 
tion trials. 

Total inhibition developed during 
extinction is the same for both groups, 
but the Zs resulting from the nonrein- 
forced acquisition trials of the partially 
reinforced group yields a total amount 
of Ig exceeding that of the continu- 
ously reinforced group and results in 
less spontaneous recovery. 
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The author knows of no studies di- 
rectly relevant to Deductions 13 and 
14. However, several studies testing 
these and other deductions relevant to 
reminiscence and spontaneous recoy- 
ery are underway in our laboratory. 


SUMMARY 


Because both the reactive inhibition 
and stimulus satiation explanations of 
spontaneous alternation seem inade- 
quate, a two-factor theory of inhibi- 
tion, in the form of three postulates, 
is presented. Deductions for the 
simple two-alternative situation, as 
well as extinction, reminiscence, and 
spontaneous recovery phenomena are 
listed, together with available support- 
ing evidence. 
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Hovering in the background of our 
Scientific enterprises is a question which 
we, as investigators, can afford neither, 
to raise very often nor to overlook 
entirely. Namely, do we have good 
reason to believe that the gerèral meth- 
ods and m a assumptions underly- 
ing our research can be counted on to 
lead in the long run to satisfactory 
interpretations of our phenomena? In 
the psychology of learning, there has 
been a division of labor, with the ex- 
perimentalists doing the overlooking, 
and each theorist having his turn at 
bringing up this question along with his 
answer. The answer is, of course, in 
each instance that the particular the- 
orist’s tactics point down the high road 
while those of the opposition lead into 
dark and forbidding culs-de-sac. 


1 This paper comprises, in substance, the 
writer’s Presidential Address to the Division 
of Experimental Psychology, American Psy- 
chological Assocation, 1959. Several indebt- 
ednesses deserve acknowledgment: an unre- 
stricted grant from the Ford Foundation 
supported portions of the experimental work; 
periods of relative freedom from academic 
routine in stimulating surroundings were 
made possible by the Department of Psychol- 
ogy, Northwestern University and the Insti- 
tute for Mathematical Studies in Social Sci- 
ences, Stanford University during the Spring 
and summer, respectively, of 1959 while the 
Paper was in preparation. 


For my own part, although my hands 
are not entirely clean in the matter of 
theorizing, I had been content until 
very recently to go along with the 
single-minded “learning experimental- 
ist,” assuming that the Lord will look 
after those who remember their control 
groups and mind their Ps and Fs. The 
incident which jarred me out of this 
comfortable way of life began as a 
simple (in fact, as will be seen, un- 
usually simple) bit of experimentation 
with no philosophical overtones. The 
original purpose of the experimentation 
was to provide more cogent empirical 
support than had hitherto been avail- 
able for one of the central concepts of 
learning theory, 

Several recent reviewers (e.g., Deese, 
1958; Estes, 1956; Kendler, 1959; 
Restle, 1959) have noted that despite 
the popular stereotype of “learning the- 
ory” as virtually synonymous with 
“controversy,” there has steadily, al- 
though unobtrusively, accumulated a 
body of concepts and assumptions which 
command relatively wide agreement and 
which contribute motivation and direc- 
tion to a great part of the research 
being done in the field of learning. At 
the center of this core of communality 
one finds a concept which represents the 
distillation of centuries of theorizing 
about learning, not to speak of 70-odd 
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years of experimentation in the tradi- 
tion of functionalism and, later, be- 
haviorism: the concept of associative 
strength. In the “mental mechanics” 
and “mental chemistry” of pre-experi- 
mental associationism, this concept was 
verbalized in terms of the strength of 
associations between ideas; with Thorn- 
dike, it became strength of stimulus- 
response bonds; with Hull, the basic 
quantitative constructs of habit strength 
and excitatory potential. And it is in 
terms of this concept that contemporary 
learning theorists express the basic dis- 
tinction between learning and perform- 
ance, as well as their fundamental pos- 
tulates relating the growth of habits to 
the number of training trials (“rein- 
forcements”’), 

There is little disagreement even in 
the precise quantitative form of these 
postulates, perhaps because our stand- 
ards of acceptability come directly from 
the observed forms of empirical curves 
relating probabilities of learned re- 
sponses, as well as measures of resist- 
ance to extinction or forgetting, to the 
number of training trials. Thus in a 
whole array of contemporary theories 
we find exactly the same equations ex- 
pressing the effect of a reinforced train- 
ing trial upon associative strength (cf. 
Restle, 1959). In Hull’s system, this 
equation is 


AH=hk(M—H) 


where A H represents the change in 
habit strength (H) on any reinforced 
trial, M is the maximum value of habit 
strength, and k is a constant. Spence’s 
(1955) revision of Hull’s theory is 
based on the same postulate, although 
there is some question as to whether the 
H in the equation should be replaced 
by E, representing excitatory Potential 
(Spence, 1955, 1958). Similarly, in 
statistical learning theory (Estes, 1959) 
the corresponding assumption has been 
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expressed in the form 
Ap=0(1—p) 


where A p represents the change in the 
proportion of stimulus elements con- 
nected to a given response, and @ is a 
constant (the proportion of stimulus 
elements sampled on the trial). In the 
stochastic model of Bush and Mosteller 
(1955), the same linear function is as- 
sumed, “strength” being simply iden- 
tified with response probability. For 
application to learning theorists, a well- 
known quotation may aptly be reversed 
to read, “In strength there is union.” 
And with respect to the experimental 
literature, it seems fair to say that the 
assumptions that associative strength 
increases with reinforcement, decreases 
during retention intervals, and general- 
izes to new stimuli are the cornerstones 
of most contemporary treatments of 
conditioning and elementary verbal 
learning. 

Areas of substantial agreement con- 
cerning either facts or interpretations, 
let alone both, are rare enough in the 
field of learning so that one might think 
we would do well to treasure the one 
we have located and carefully avoid 
doing anything to disturb it. This I 
(along with most of my fellow theorists 
and experimentalists) was happy to do, 
until my hand was forced by some 
purely experimental developments. The 
first of these was the, by now well- 
known, work of Kimble and his asso- 
ciates (1955, 1956; Dufort & Kimble, 
1958) who, working with an eyelid 
conditioning situation, tried the novel 
procedure of omitting the CS on a sub- 
stantial block of trials during an acqui- 
sition series. The rather surprising re- 
sult of this variation was that the course 
of acquisition was virtually unaffected. 
Their interpretation was that only the 
first few reinforcements are actually ef- 
fective in modifying the strength of 
associations between CS and CR, the 
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rest of the “conditioning curve” actually 
reflecting some nonassociative process. 
The second development was contrib- 
uted by Rock (1957), who found no 
retardation in the speed of paired-asso- 
ciate learning when he introduced the 
device of replacing missed items with 
new ones at the end of every trial. 
Rock interpreted his findings as indicat- 
ing that associations form on an all-or- 
none basis and that it is only associa- 
tive strength in the sense of resistance 
to forgetting that grows as a function of 
number of reinforcements. 

It seemed curious that both of these 
deviations from the usual experimental 
paradigms had yielded results which 
are in some respects sharply at variance 
with the generally accepted conception 
of the acquisition process. Of course 
these findings might turn out to have 
Special explanations that would leave 
the established conception undisturbed. 
But then again, they might not. In the 
case of the Kimble, Mann, and Dufort 
(1955) study, counterexplanations and 
counterexperiments have already begun 
to pour into the literature (see, e.g., 
Goodrich, Ross, & Wagner, 1957) 
with the result so far of complicating 
matters still further rather than of 
clearing anything up. One might argue 
that concepts and assumptions which 
have been supported by a large accum- 
ulation of experimental findings cannot 
be seriously threatened by one or two 
apparently negative results. But this 
argument is weakened by the observa- 
tion that all of the empirical support 
for the conceptualization of learning in 
terms of associative strength is quite 
indirect in character. 

The basic concepts and assumptions 
of learning theory are universally sup- 
posed to refer to states and processes of 
the individual organism. Yet the exist- 
ing evidence for the assumption that 
associative strength is an increasing 
function of number of reinforcements 
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comes from performance curves repre- 
senting average response measures over 
groups of learners, or from measures 
of resistance to extinction or retention 
scores averaged over groups of learners 
having different values of the independ- 
ent variable. Even the few bits of neg- 
ative evidence are indirect, depending 
on performance curves obtained under 
deviations from the usual experimen- 
tal paradigms but still representing 
changes in average scores over series 
of trials for groups of subjects (Ss). 
It would seem that if our basic con- 
ceptions are sound, it should be possible 
to cut through the web of group per- 
formance curves and obtain more direct 
and compelling evidence for the exist- 
ence of the assumed states and proc- 
esses in individual organisms. This, in 
any event, is what I set out to accom- 
plish for the concept of associative 
strength in a series of experiments now 
to be reported. 


ON THE DEFINITIONS OF “REINFORCE- 
MENT,” “TEST TRIAL,” AND 
“LEARNING” 


In standard human learning situa- 
tions, “learning” is almost universally 
defined and measured in terms of a 
change in the probability, or frequency, 
with which a given stimulating situa- 
tion evokes a response (or instances of 
a response class) that has been desig- 
nated as “correct” by the experimenter. 
With one reservation, to be noted be- 
low, I shall follow this usage. But the 
situation is quite different with rein- 
forcement,” the same term being used 
in at least two quite different senses by 
different investigators and thus promot- 
ing no end of confusion. My own 
habitual usage is the “neutral defini- 
tion” (Hilgard, 1956, p. 409) which 
identifies reinforcement empirically with 
the operation that is supplied by the 
experimenter in order to produce learn- 
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ing, as defined above, in any given sit- 
uation. In a paired-associate situation, 
the reinforcing operation is the paired 
presentation of the stimulus and Te- 
sponse members of an item; in classical 
conditioning it is the paired presenta- 
tion of CS and US; in verbal condition- 
ing, the reinforcing operation for a 
given predictive response (e.g., predict- 
ing that the left light will appear) is the 
occurrence of the corresponding event 
(appearance of the left light)—in each 


case without regard to whether the S ~ 


correctly anticipated the response mem- 
ber of the paired-associate item, gave a 
CR prior to occurrence of the US, or 
correctly predicted the event on the trial 
in question. The only property that 
different types of reinforcing operations 
are assumed to share is their common 
quantitative effect on the conditional 
probabilities of the possible alternative 
responses to the stimulating situation 
in which reinforcement occurs. 

A narrower definition, favored espe- 
cially by writers associated with a 
drive-reduction interpretation of rein- 
forcement, would limit the term rein- 
forcement to an operation that follows 
and is contingent upon the occurrence 
of the reinforced response on any trial. 
In this usage, reinforcement in paired- 
associate learning occurs only when the 
S has made a correct response in antic- 
ipation of the paired stimulus-response 
presentation, and reinforcement in ver- 
bal conditioning occurs only on trials 
when the S correctly predicts the trial 
outcome. Whether, according to this 
view, reinforcement occurs on only those 
trials of a classical conditioning experi- 
ment on which a CR occurs prior to the 
US depends upon theoretical decisions 
as to whether the CR and UR are “the 

same response” and whether reinforce- 
ment occurs at the onset or the termina- 
tion of the US. . 

The primary advantage I see in the 
“neutral definition” is that it can be 
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applied in an objective and consistent 
manner independently of one’s position 
on systematic or theoretical issues. 
Learning certainly may occur prior to 
the first correct anticipation in a paired- 
associate experiment, prior to the first 
correct prediction in verbal condition- 
ing, prior to the first CR in classical 
conditioning. The present usage per- 
mits us to speak, for example, about 
changes in probability of a response as 
a function of reinforcements on trials 
preceding its first occurrence, on the 
one hand, and changes as a function of 
reinforcements on trials including and 
following its first occurrence, on the 
other, without changing our definition 
of reinforcement. 

It should be emphasized that the 
neutral definition does not beg such 
questions as whether presentation of a 
US following a CR constitutes the same 
reinforcing operation as presentation of 
a US on a trial when the CR did not 
occur; these two procedures represent 
instances of the same reinforcing opera- 
tion if and only if they produce the 
same change in the probability of evoca- 
tion of the CR by the CS. However, 
it seems strategic to avoid issues of this 
sort when, as in the present investiga- 
tion, we are concerned with the nature 
of the changes in response tendencies 
during learning rather than with the 
conditions giving rise to these changes. 
Consequently, in the experiments to be 
reported, we have attempted so far as 
possible to avoid the customary con- 
founding of reinforcement with ante- 
cedent response. In paired-associate 
situations, for example, we have de- 
viated from the usual anticipation pro- 
cedure by separating the reinforcement 
(paired-presentation of stimulus and 
response members of an item) from the 
test for learning (presentation of the 
stimulus member alone) so that an item 
may receive more than one reinforce- 
ment before the first test trial or may 
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receive repeated test trials without in- 
tervening reinforcement. 

For purposes of measuring retention, 
it would be ideal if one could give test 
trials on which no learning at all oc- 
curred. Indeed, so long as “learning” 
is conceived solely in terms of the def- 
inition given above (increase in prob- 
ability of the “correct” response to a 
given stimulus), this goal is not too 
difficult to approximate. It seems in- 
tuitively clear, and can be demonstrated 
empirically (Estes, Hopkins, & Croth- 
ers, 1960), that no systematic increase 
in probability of correct responses to, 
Say, paired-associate items will occur 
Over a series of trials on which the 
stimulus members are presented alone 
and the S’s responses receive no re- 
ward or informational feedback from 
the experimenter. We cannot, how- 
ever, rule out the possibility that on 
these trials there might be learning in 
the sense of an increase in probability 
of whatever responses, correct or incor- 
Tect, actually occur. In fact, there is 
evidence that such learning does occur, 
but at a relatively low rate compared 
to the learning that occurs on rein- 
forced trials (Estes et al., 1960). Con- 
Sequently, in the analyses to follow, I 
shall assume that unreinforced trials 
can be treated, without serious error, 
simply as “neutral” test trials when 
primary interest is in measuring the 
effects of preceding reinforced trials. 


UNITARY ASSOCIATION vs, 
Hasir HIERARCHY 


If the stimulus-response relation es- 
tablished by reinforcement were a uni- 
tary, all-or-none connection of some 
sort, then the learning of a new re- 
Sponse to a stimulus would automat- 
ically displace a previously associated re- 
Sponse. Contrariwise, in every variant 
of the concept of associative strength, it 
is assumed that a number of different 
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responses may simultaneously be asso- 
ciated with the same stimulus, the rela- 
tive strengths of association depending 
primarily upon preceding frequencies 
of reinforcement of the different re- 
sponses in the presence of the stimulus. 
This latter assumption has been em- 
bodied in Hull’s “habit family hier- 
archy.” ; 
To spell out the question at issue in 
experimental terms, let us suppose that 
first Response A and subsequently Re- 
sponse B have been reinforced in the 
presence of a given stimulus (or stim- 
ulus complex—the nature of the stim- 
ulation does not matter so long as it is 
the same on each trial) for each mem- 
ber of a group of Ss and that a test trial 
now reveals an observed probability 
(relative frequency) of .5 for each re- 
sponse. Does this mean that for each 
S both Response A and Response B 
now have approximately equal habit 
strengths and therefore equal probabil- 
ities of evocation by the given stimulus? 
Such is the interpretation required by 
the conception of associative strength 
(habit strength, excitatory potential, 
proportion of conditioned elements, or 
whatever) as well as by any of the con- 
temporary models formulated in terms 
of continuously variable response prob- 
abilities for individual Ss. Stated in 
these concrete terms, however, it does 
not seem that the assumption need de- 
pend for support only on extremely 
indirect evidence, as has hitherto been 
the case. To test these implications of 
the strength concept quite directly, we 
apparently need only reinforce two ae 
ferent responses to the same stimulus 
for an individual S, and then, by ae 
of a series of unreinforced test trials, 
give the two responses an Opportunity 


to exhibit their relative strengths—if 
such exist, The predictions under test 


would be straightforward. On the hy- 
pothesis of associative strength, or habit 
family hierarchy, we expect individual 
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Ss to shift back and forth between the 

two previously reinforced responses 

over a series of tests. On the hypothe- 

sis of a unitary, all-or-none association 

between stimulus and response, we ex- 

pect that an individual S who makes 

Response A, say, on the first test trial 

with a given stimulus will not shift to 

B on subsequent tests. 

A paired-associate learning situation 
offered a number of convenient features 
for an empirical realization of the hypo- 
thetical experiment just described. The 
stimuli and reinforcements are readily 
controllable; and, by embedding a stim- 
ulus to which two different responses 
are to be reinforced in a conventional 
list, one can make S’s task appear little 
different from familiar learning situa- 
tions, For each of 20 Ss, the proce- 
dures were replicated with six eight- 
item lists, all made-up on the same 
principles. The stimuli were all con- 
sonant syllables and the responses one- 
syllable words. Half of the items in 
each list had single correct responses ; 
examples are: 


STIMULUS RESPONSE 
HTX wish 
JFR sped 


These were simply “ballast” and will not 
be considered further. The other half 
of the items had two correct responses 
for each stimulus; examples are: 


STIMULUS RESPONSES 
DGR thaw, weep 
BCG pink, rule 


The two correct responses to each stim- 
ulus were reinforced equally often for 
each S, By “reinforcement,” in this 
context, we mean simply a paired pres- 
entation of stimulus and correct re- 
sponse to S; by “test,” a presentation 
of the stimulus member of an item 

2This experiment was conducted at In- 
diana University with the assistance of E: J. 
Crothers. 
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alone. In half of the replications, there 
were exactly two reinforcements pre- 
ceding the test trials, one on each re- 
sponse; in the remaining replications, 
there were four reinforcements, two on 
each response, prior to the test trials. 
Following the reinforcements, a series 
of unreinforced test trials was given, 
the order of the stimuli being random- 
ized anew for each test. 

The critical data for our purposes are 
the proportions of cases in which the 
response given on the first test trial to 
a given stimulus was repeated on the 
second test. According to an interpre- 
tation in terms of associative strength, 
or habit family hierarchy, repetitions 
(AA or BB) and shifts (AB or BA) 
should have occurred with roughly 
equal frequencies.* According to the 
notion of a unitary association, AA and 
BB should have each occurred about 
half of the time and the shifts AB and 
BA not at all. The results were as 
follows: Following two reinforcements, 
there were 85% repetitions and 15% 
shifts on the first two test trials, the 
percentages being based on all items 
that had correct responses on both tests. 
Following four reinforcements, these 
values were 89% for repetitions and 
only 11% for shifts. On later pairs of 
tests, the frequencies of AB and BA 
shifts were even smaller. Thus the 
results do not offer very impressive 
support for the assumption that a habit 
hierarchy exists in the individual S fol- 
lowing the reinforcement of two differ- 
ent responses to the same stimulus. On 
the other hand, the data appear quite 
harmonious with an assumption of uni- 
tary association, since the proportions 
of AB and BA shifts appear small 

3 The response given on the first test trial 
was the more recently reinforced response 10 
approximately 60% of cases. 5 
stances of AA than BB would be expected ; 
nevertheless the proportions of repetitions an 
shifts expected on the associative strength 
hypothesis are nearly equal (.52:.48). 


Thus more in- 
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enough to be attributable to minor un- 
controlled factors such, for example, as 
fluctuations in context (background 
stimulation, stimulus traces from pre- 
ceding items) from one trial to the 
next.* 

Although the present experiment 
failed to produce the anticipated direct 
support for the concept of associative 
strength, a “strength theorist,” partic- 
ularly one working within the frame- 
work of Hull’s system, may not be too 
disturbed. With the benefit of an ad 
hoc assumption concerning the range of 
“behavioral oscillation,” the, model of 
Hull and Spence can be made to yield 
the prediction that, of two responses 
reinforced to the same stimulus, one or 
the other will dominate on test trials, 
thereby preventing shifts from one to 
the other on the part of individual SS; 
In order to obtain a more decisive test 
of the strength conception, we evidently 
require an even simpler experiment in 
which only one response is ever rein- 
forced, 


With the hope of forestalling an 
indefinite regress through a series of 
progressively more refined experiments, 


4 It should be noted that these results cast 
doubt on the concept of habit hierarchy only 
as applied to response probabilities in the 
presence of a stimulus situation that has been 
manipulated as a unit over the series of 
reinforced and test trials, M. S. Schoefiler 
(personal communication) and, in another 
unpublished study, Crothers and I have ob- 
tained positive evidence for the existence of a 
habit hierarchy associated with a compound 
stimulus, components of which have been 
separately correlated with reinforcements 
during the training trials. The results with 
compounding provide a control for the pres- 
ent experiment. If the very high proportions 
of repetitions reported above were attrib- 
utable to learning that occurred on the first 
test trial, or to some nonassociative variable, 
then: a similar excess of repetitions should 
have occurred over successive tests with 
stimulus compounds; however, no such o 
cess was observed either in Schoeffler’s study 
of Compounding or in ours. 
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suppose we ask what is the minimum 
set of operations and observations ac- 
tually needed in order to demonstrate 
learning, Normally there must be a 
pretest in order to determine the initial 
probability of the to-be-learned behav- 
ior in the test situation; in practice the 
experimenter often has a priori infor- 
mation about initial response probabil- 
ities which makes the pretest dispens- 
able. There must be a presentation of 
some reinforcing operation, and after- 
ward a test to assess the change in 
performance produced by the reinforce- 
ment. If the function of response oc- 
currences and nonoccurrences is to be 
determined, there will have to be a 
second test trial. And there we have it. 
Controlled comparisons relative to ef- 
fects of the principal events occurring 
during an acquisition series can, in 
principle, be accomplished in an experi- 
ment running to about a trial and a half. 
By usual standards, _this constitutes 
what can only be called a “miniature 
experiment.” However, miniature ex- 
periments appeared to be what the tac- 
tical situation called for, and therefore 
miniature experiments are what we set 
out to run. 


CoNCEPTIONS OF THE ACQUISITION 
Process: ASSOCIATIVE STRENGTH 
vs. ALt-Or-Nonr Mopets 


In the first of these experiments,® we 
used a paired-associate situation with 
consonant syllables as stimuli and num- 
bers as responses. Forty-eight Ss were 
Tun with an eight-item list, yielding 384 
observations on the first test trial. The 
principal portion of the experiment con- 
sisted simply in presenting each S once 
with each stimulus-response pair and 
then testing with each stimulus alone 
(in a new random order). Before pro- 

5 Thi aad was conducted at In- 
diane oo eee the assistance of B. L. 
Hopkins; for a full report of the method and 
results sce Estes et al. (1960). 
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Fic. 1. Schema representing effects of a 


single reinforcement according to incremen- 
tal (upper branch) vs. all-or-none (lower 
branch) theories. Squares represent Ss, 
with the proportion of darkened area in each 
indicating the probability of the correct re- 
sponse (C) for the given individual. 


ceeding to the results, let us examine 
the outcome expected on the basis of 
the notion of learning as a change in 
associative strength. In Figure 1 the 
situation is schematized in terms of a 
single item. The four squares at the 
left represent four hypothetical Ss, the 
emptiness of the squares indicating that 
all start the experiment with zero prob- 
abilities of making the correct response. 
Now we give a single reinforcement 
(paired presentation of the stimulus 
and correct response), the result of 
which is to raise the probability of the 
correct response (C) to, say, .25. The 
upper arrow leads to the theoretical 
state of affairs after this reinforcement, 
according to an interpretation based on 
the conception of associative strength. 
The strength of the association is in- 
creased for all of the Ss; and, neglect- 
ing for the moment possible individual 
differences, the probability of the cor- 
rect response is now .25 for each indi- 
vidual, the one at the upper right who 
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happened to make a correct response 
on the test, and the three who did not. 

Suppose now that the interpretation 
based on the concept of strength were 
completely wrong and that stimulus- 
response associations really formed on 
an all-or-none basis. Then the state of 
affairs after the reinforcement should 
be as shown in the lower part of the 
figure. Again the probability of a cor- 
rect response increases from zero to 25; 
but the .25 now refers only to the 
group, not to any individual S. One S 
has formed the association (darkened 
square), and three have been unaffected 
by the reinforcement (empty squares). 

To distinguish empirically between 
these two logically possible outcomes, 
we need only add the remaining half- 
trial to our trial-and-a-half, i.e., give 
another test without intervening rein- 
forcement. Now, if the upper branch 
of the diagram is essentially correct, all 
Ss should have equal probabilities of 
making the correct response on the 
second test trial, regardless of what 
they did on the first test. But if the 
lower branch is correct, correct re- 
sponses on the second test should come 
only from Ss who made correct re- 
sponses on the first test. None should 
come from Ss who made incorrect re- 
sponses (N) on the first test, for these 
Ss would not have profited at all from 
the learning trial. 

There might be some attenuation of 
the expected proportions of correct re- 
sponses on Test 2 by Ss making correct 
responses on Test 1 if there is any for- 
getting in this situation, but the pro- 
portions of correct following incorrect 
provide a critical comparison. If the 
all-or-none view is correct, then this 
proportion should be zero, or at least 
no greater than could be achieved by 
sheer guessing. But if any version of 
the strength conception is correct, then 
the proportion of correct following 10- 
correct responses should be greater than 
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chance. In order to make the outcomes 
that can be tolerated by the two inter- 
pretations sharply different, we need 
only choose our experimental materials 
and conditions so that the overall pro- 
portions correct on both first and sec- 
ond tests are well above chance. It can 
be seen in Figure 2 that this has been 
achieved, for approximately 50% of the 
items were correct on the first test and 
nearly 40% on the second. Consider- 
ing the critical lower branch of the 
diagram, leading from an incorrect re- 
sponse on the first test to a correct re- 
sponse on the second, we see that the 
results lean strongly in the direction 
prescribed by an all-or-none conception, 
for the 99% of correct following incor- 
rect responses is less than the 1214% 
that could be achieved even by rather 
unintelligent guessing with an eight- 
item list if the reinforcement had no 
effect at all on these items. The differ- 
ence between this value and the 71% 
of correct following correct responses is 
So large that a statistical test would be 
an empty formality. 

A possible defense that might be ad- 
vanced by a “strength theorist” is the 
hypothesis that the 51% of cases with 
incorrect responses on the first test 
simply represent preponderantly slower 
learners or more difficult items than the 
49% of cases with correct responses. 
If so, then a control condition in which 
a second reinforcement is given be- 
tween the first and second tests should 
yield a percentage of correct responses 
on Test 2 following incorrect on Test 1 
that is much smaller than the percen- 
tage correct on Test 1. This control 
was run (with the same 48 Ss but dif- 
ferent items), and the result is shown 
in Figure 3. The effect of the first re- 
inforcement on the full set of Ss and 
items was to raise the probability of a 
Correct response from near zero to .40; 
the effect of the second reinforcement 
On cases having incorrect responses on 
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the first test was to raise the probability 
of a correct response from near zero to 
.46. Thus there seems to be no support 
forthcoming for the hypothesis of a 
large difference in learning rate be- 
tween cases which did and cases which 
did not have correct responses on the 
first test. . 
Although it would be nice to claim 
credit for rare prescience in predicting 
the outcome of this little experiment, 
the fact is that the result came as a 
distinct jar to my preconceptions. In 
designing the study, our idea was not 
to undermine the strongly entrenched 
concept of associative strength, but to 
support it by showing that the results 
of Rock’s experiments, apparently call- 
ing for an all-or-none interpretation, 
must be attributed to some artifact con- 
cealed in his ingenious but somewhat 
complex procedures. Thus when Hop- 
kins and I examined the data from our 
initial group of 24 Ss and found the 
pattern shown in Figures 2 and 3, our 
first reaction was to replicate the whole 
thing with another group. But when 
the two replications turned out to agree 
in every essential respect, we were left 
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Fic. 2. Results of miniature Ne 
on acquisition of paired-associates. Empir 
cal values are proportions of instance in 
which correct (C) and incorrect (N) ma 
Sponses on first test trial after a single 
reinforcement (paired presentation of stim- 
ulus and response members) were followed 
by C and N responses on a second test trial. 
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a second reinforcement for cases not having 
correct responses on first test compared with 
proportion correct on first test for the full 
set of Ss and items. The Ss and situation 
are the same as those represented in Figure 2. 


with no obvious course but to begin 
digesting an unanticipated and not en- 
tirely palatable conclusion. The most 
cleanly controlled comparisons we had 
managed to devise yielded no evidence 
that repeated reinforcements in this sit- 
uation have any function other than to 
give repeated opportunities for the dis- 
continuous formation of a learned asso- 
ciation between observed stimuli and 
responses. 

Still, it is well known that theoretical 
doctrines do not yield readily to nega- 
tive evidence. One whose theories are 
based on a concept of strength will lose 
little ground if he can make a stand on 
the claim that all-or-none acquisition is 
simply a peculiarity of the paired-asso- 
ciate experiment and not characteristic 
of human learning in general. To eval- 
uate this possible defense of the strength 
concept, we clearly shall have to turn 
to some different situation that is quite 
different in the response mechanism 
and reinforcing operations from paired- 
associate learning. Eyelid conditioning 

meets these specifications, and it is 
convenient for our purposes since a 
colleague, I. Gormezano, has kindly 
made available his data from an in- 
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tensive period of data collecting in 
the Wisconsin conditioning laboratory. 
Gormezano trained a sufficiently large 
group, approximately 170 Ss, under 
identical conditions so that the first few 
acquisition trials can be treated as one 
of our miniature experiments and ana- 
lyzed in much the same way as the 
paired-associate study. 

The situation obtaining over the first 
couple of trials is schematized in Figure 
4. In the diagram, T, is the first CS 
presentation, prior to the first reinforce- 
ment, and we shall consider only Ss 
who made no CR on this test. Thus 
the initial probability of a CR is taken 
to be zero. Suppose now that the 
effect of the first reinforcement is to 
raise the probability of a CR to .25. 
According to the strength conception, 
shown in the upper panel, each S has 
his strength of conditioning increased 
by the same amount by this reinforce- 
ment- and now has probability .25 of 
making a CR. Then the second re- 
inforcement increases the conditioned 
strength for each S again; and, regard- 
less of whether or not a particular S 
happened to make a CR on T,, he now 
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Fic. 4. Schema for first two trials of eye- 
lid conditioning experiment showing chanse 
in CR probability (proportion of dake 
area in squares representing S's) prescribe 
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has a higher probability (%_ if we 
apply the linear function mentioned 
earlier).© According to an all-or-none 
conception, the situation after the first 
reinforcement, shown in the lower 
panel, is that for 14 of the Ss the CR 
has become associated with the CS and 
for the remaining 34 of the Ss no con- 
ditioning has occurred. The effect of 
the second reinforcement is to give the 
unconditioned Ss another chance, and 
Y, of these now become conditioned. 
The differential prediction, then, con- 
cerns the probability of a CR on the 
third test for Ss who made no CR on 
the second test (and similarly the prob- 
ability of a CR on the fourth test for Ss 
who made none on any previous test, 
and so on). The strength conception 
requires this conditional probability to 
increase, whereas the all-or-none con- 
ception requires it to remain constant. 
The test seems quite sharp, for even with 
allowance for variation in conditioning 
rates among Ss, a model which assumes 
that associative strength increases with 
reinforcements cannot stand constancy 
of this probability unless its assump- 
tions are so restricted that it reduces to 
an all-or-none model. 

The pertinent results of Gormezano’s 
study are shown in Figure 5, carried 
through the first four trials, beyond 
which the number of cases begins to 
drop off too much for comfort, In- 
specting the sequence of probabilities 
of CRs after 1, 2, or 3 consecutive 
NCRs—.13, .15, .14—we find the hy- 
pothesis of constancy appearing rather 
more attractive than the progressively 


6 For this example, the parameter 0 in the 
function Ap=e(1—>p) is equal to ⁄ and 
after the first experiment $ is also equal to 
M. Therefore we have 

Ap=\%(1— %) =e 
and for the new probability after the second 
reinforcement, 


p + Ap =Y + Ms =e 
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Fic. 5. Trial-by-trial acquisition data from 
Gormezano’s study of eyelid conditioning. 
Values of particular interest are the propor- 
tions of CRs following 1, 2, or 3 consecutive 
non-CRs, 


increasing trend required by the strength 
interpretation. (According to a linear 
model, for example, the value .15 for a 
CR after an NCR should have been .24, 
and the .14 for a CR after two. NCRs 
should have been .34.) 

The consistency of these conditioning 
data with those of the paired-associate 
situation is almost too good to be true. 
In psychology we are not used to hav- 
ing quantitative tests of alternative the- 
oretical notions yield such apparently 
decisive outcomes. Consequently, and 
considering the importance of the the- 
oretical issue, perhaps we will not yet 
be accused of beating a dead hypothesis 
if we look for one more test with ex- 
perimental arrangements differing from 
both of those preceding. We would 
like a situation similar to paired-asso- 
ciate learning in that unreinforced test 
trials can readily be given without dis- 
turbing the learners but one which elim- 
inates the possibility of achieving sub- 
stantial proportions of correct responses 
by guessing. A situation which meets 
these desiderata is the free verbal recall 
experiment used by Bruner, Miller, and 
Zimmerman (1955). For our present 
Purposes the minimal experiment will 
consist of a single reinforcement fol- 
lowed by two recall tests. The rein- 
forcement involves merely the experi- 
menter’s reading a list of words aloud 
to S. On a recall test, S is asked to 
write down as many words as he can 
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Fic. 6. Results of two miniature experi- 
ments on free verbal recall, showing near- 
zero proportions of correct responses on a 
second test trial for cases which did not have 
correct responses on the first test after a 
single reinforcement. 


remember (in any order). Then after 
an interval during which no additional 
reinforcement is given, S is (unex- 
pectedly) tested again. 

Results of two experiments? of this 
sort are shown in Figure 6. The upper 
tree represents an experiment with 35 
Ss, each given a list of eight words at 
R,. On the first test, T,, 61% of the 
280 opportunities for correct responses 
(C) yielded either incorrect responses 
or omissions (N), and of these less 
than 1% were followed by correct re- 
sponses on the second test, T,. In a 
replication conducted with some minor 
variations in procedure, 102 Ss were 
presented with eight words each on the 
reinforced trial. This time (lower tree 
in Figure 6) there were 72% N re- 
sponses on the first test and less than 
2% of these were followed by C re- 
sponses on the second test. Clearly, if 
a word is not given correctly on the 
first test by a particular S, the chances 
are virtually nil that it will be correct 
on a second test. 

7 These experiments, conducted at Indiana 


University with the assistance of Judith 
Crooks, will be reported in detail elsewhere. 
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This result does not, of course, 
prove that reinforcement has exerted no 
strengthening effect on the associations 
in the cases when the correct response 
failed to occur on the first test. But 
one whose theory requires him to as- 
sume that such strengthenings occur 
has a taxing assignment in producing a 
case for the existence of factors or proc- 
esses which appear in just sufficient 
force to cancel out the hypothesized 
increments in response strength under 
each set of experimental procedures we 
have examined. Explanations depend- 
ing on such factors as individual differ- 
ences are not very prepossessing in the 
light of control comparisons of the type 
exhibited in Figure 3 (a similar con- 
trol, with a similar result, was used for 
the free-verbal-recall situation). One 
might appeal to the effects of learning 
which occurs on the first test trial itself, 
arguing that an incorrect response which 
occurs on the first test receives a large 
increment in associative strength (from 
sheer contiguity or perhaps from some 
unspecified source of reinforcement) 
and therefore recurs with high prob- 
ability on the next test. One important 
difficulty with this hypothesis is that the 
data do not support it. In the paired- 
associate study cited above, for ex- 
ample, the observed relative frequency 
with which an incorrect response oc- 
curring on the first test was repeated 
on the second test was only .24. Inter- 
pretations which preserve the incremen- 
tal conception of associative learning 
should certainly be sought with all 
vigor; at the time of writing, however, 
none has come to my attention that 
seems at all plausible. 


RETENTION as A FUNCTION OF REIN- 
FORCEMENT AND INTERSPERSED 
Test TRIALS 

The story does not end here. Even 
if one is ready to grant that associations 
are made and broken in an all-or-none 


Pat 


LEARNING THEORY AND THE New “M 


fashion, this concession, although by no 
means a minor one, does not exhaust 
the resources of the strength concept. 
It is possible that after an association 
has once formed, associative strength in 
the sense of resistance to forgetting con- 
tinues to grow as a function of rein- 
forcements. In fact the experimental 
situations we have used appear well 
suited to demonstrate such an effect, 

It will not have escaped notice that 
although the three miniature experi- 
ments yielded sharply negative results 
relative to the conception of learning as 
a gradual strengthening of associations 
by reinforcement, they agree only in 
part with the usual notion of all-or- 
none learning (as, for example, defined 
by Voeks, 1955). The formation of 
learned associations appears to be dis- 
continuous rather than gradual, but 
Once a correct response (or conditioned 
response) has occurred to a given stim- 
ulus, it does not appear with certainty 
when the stimulus recurs after a rest 
interval. In fact, “forgetting” as meas- 
ured by the proportion of shifts from a 
correct response on the first test to non- 
correct on the second test ranges from 
about 20% in the paired-associate sit- 
uation to about 50% in free verbal 
recall. For the eyelid conditioning 
study, we did not report these propor- 
tions since the numbers of observations 
on which they were based were rather 
small; but for what it is worth, we 
might add that the proportions of CR 
to NCR shifts following the first three 
tests, despite the intervening reinforce- 
ment, fall well within the range given 
above for the other two experiments. 

The minimum set of operations and 
observations required to test for a de- 
pendence of “resistance to forgetting” 
upon number of reinforcements is ex- 
hibited in Table 1. Following either 
one or two reinforcements, a sequence 
of two unreinforced test trials is given ; 
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and retention is measured in terms of 
the proportion of instances in which a 
correct response to a given stimulus on 
the first of these tests is repeated on 
the second test. Results of two such 
experiments are shown in the table. 

The left-hand column of proportions 
represents data from a study conducted 
by the writer with the assistance of 
E. J. Crothers (Estes et al, 1960). 
Twenty Ss were each run on six eight- 
item paired-associate lists under each 
condition (number of reinforcements). 
The stimuli were nonsense syllables, 
and the responses were numbers in half 
the lists and familiar words in the other 
half. The right-hand column presents 
data from the free verbal recall experi- 
ment previously cited (102 Ss each 
tested with a list of eight items under 
each condition). The pattern of test 
proportions needs little comment. In 
each case the difference called for by 
the traditional conception of associative 
strength fails to appear. 

This outcome is a little hard to swal- 
low. It is well known that retention 
increases with overlearning. If the ad- 
ditional reinforcements given during the 
overlearning period do not produce the 
increased resistance to forgetting, then 
what does? We may obtain a clue as 
to the answer by introducing what 
might a priori seem to be very slight 
change in the design of our minimal 
experiment on retention. This variable 
is illustrated in Table 2, In the free 


TABLE 1 


DESIGN AND RESULTS oF MINIMAL EXPERI- 
MENT ON RETENTION AS A Fu ON 
or NUMBER OF REINFORCEMENTS 


Amount of Retention 


Procedure P-A FVR 
RT Tz -90 54 
RRM T2 .89 52 
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TABLE 2 


DESIGN AND RESULTS OF MINIMAL EXPERI- 
MENT ON RETENTION AS A FUNCTION 
or A Test TRIAL INTERPOSED 
BETWEEN REINFORCEMENTS 


Amount of 


Procedure Retention 
RR Tı Te -52 
RTRT: Ts -18 


verbal recall experiment cited previ- 
ously, we included, in addition to the 
condition shown in the upper row 
(which is the same as in Table 1), a 
condition with the same number of re- 
inforcements but with an additional 
interspersed test trial. The idea was 
to give an opportunity for correct re- 
sponses learned on the first reinforced 
trial to become conditioned to cues 
which are present only ‘on test trials 
and not on training trials—in other 
words to spread the effect of the first 
reinforcement over more stimuli. Re- 
tention is measured in terms of the pro- 
portion of correct responses repeated 
from T, to T, in the upper row and 
from T, to T, in the lower row. In 
contrast to the lack of effect on reten- 
tion produced by increasing the number 
of reinforcements, note the large effect 
of the added test trial. Two reinforce- 
ments without an intervening test yield 
only 52% retention, whereas two rein- 
forcements with an intervening test trial 
yield 78% retention from the first to 
the second test following the second 
reinforcement. 

Although the principle that retention 
increases with number of reinforce- 
ments is exceedingly well established in 
the lore of human learning, we must 
face the possibility that this empirical 
relation, like the classical acquisition 
curve, is an artifact of the confoundings 
inherent in the usual experimental par- 
adigm. 
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CONCLUDING REMARKS 


To recapitulate the box score: we 
have conducted a series of highly sim- 
plified experiments especially designed 
to provide relatively direct evidence for 
the widely accepted interpretation of 
learning and retention interms of incre- 
ments and decrements in associative 
strength. We noted that in virtually 
all contemporary learning theories, the 
concept of strength is assumed to have 
at least three different empirical man- 
ifestations—the habit hierarchy, the 
growth of associative strength in the 
sense of response probability as a func- 
tion of number of reinforcements, and 
the increase in resistance to forgetting 
of once established associations with 
additional reinforcement. We tested 
for all three of these effects by means 
of the simplest controlled comparisons 
we could arrange, doing our utmost to 
eliminate the confoundings and the lay- 
ers of statistical processing that shield 
the behavioral changes occurring on in- 
dividual learning trials completely from 
view in conventional experimental de- 
signs. And under these presumably 
favorable circumstances, all three ef- 
fects mysteriously evaporated, leaving a 
picture of unitary associations the learn- 
ing and unlearning of which proceed on 
an essentially all-or-none basis.$ 


8 Several readers of a prepublication draft 
of this article have raised the question 
whether alternative measures, e.g., response 
latencies or recognition scores, might yield 
evidence of learning in cases where the prob- 
abilities of conditioned responses or of cor- 
rect recalls do not. Regardless of the an- 
swer, it is important to note that information 
about concomitant changes in other variables 
would have no logical bearing upon conclu- 
sions pertaining to the one actually chosen 
for analysis in a given experiment. To 
determine whether the behavior of recalling 
the correct response to a paired-associate 
item is learned in all-or-none fashion, we 
required an analysis according to the par- 
adigm of Figure 2 with recall score as the 
dependent variable. To determine whether 
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What is the import of these results 
for the question raised earlier concern- 
ing the strategic soundness of the gen- 
eral methods on which investigators of 
learning chiefly rely? The concept we 
have examined in detail epitomizes the 
intervening-variable paradigm for the- 
ory construction that has been popular- 
ized by Tolman and Hull, and their 
followers, to the point of dominating 
contemporary learning theory. The 
general technique is to postulate a hypo- 
thetical state or entity which is held to 
intervene in some sense between ob- 
served stimulus and response variables, 
Predictions derived from theories built 
around such constructs are checked 
against data from standard learning ex- 
periments. Thus the adequacy of the 
theories depends in turn on the ade- 
quacy with which essential aspects of 
learning are captured by standard ex- 
perimental designs. 

‘I have emphasized the term “stand- 
ard” for despite the fact, decried by 
many critics, that psychologists in this 
field have resisted all admonitions to 
standardize their tasks, situations, and 
procedures, they have more than com- 
pensated for this lack by the degree to 
which they have standardized their ex- 
perimental designs. Scanning the ref- 
erence list of, say, Hilgard’s Theories 
of Learning (1956),° one can find no 


correct recognition of a previously viewed 
item is learned on an all-or-none basis, we 
would need a similar analysis with recogni- 
tion score as the dependent variable. To 
determine whether the behavioral change as- 
sociated with a decrease in latency is learned 
on an all-or-none basis, we would need a 
similar analysis with some criterion of change 
in latency as the dependent variable, and 
sO on. b 3 

9 The harvest is even scantier if one con- 
sults the more monographic works, e.g., Tol- 
man (1932), Hull (1943), or Spence (1955), 
ciated with systems of intervening-var- 
iables in behavior theory; or, for that matter, 
my own theoretical writings (Estes, 1959). 
Tn the eclectic treatise of McGeoch and Trion 


more than a meager handful of studies 
which are not molded into a paradigm 
that might be termed the groups-by- 
trials design. The master blueprint re- 
quires the investigator to average some 
type of performance score, e.g., fre- 
quency of conditioned responses or of 
correct responses, over a series of trials 
for groups of Ss—the groups being dif- 
ferentiated on the basis of conditions 
obtaining over the series. Going back 
a few years in the experimental jour- 
nals, one finds the customary output of 
this experimental paradigm to be a set 
of mean performance curves for experi- 
mental and control groups; more re- 
cently the output is typically an array 
of analysis of variance tables, perhaps 
supplemented by some mean criterion 
scores. These observations in them- 
selves do not constitute a criticism. No 
one would gainsay that the groups-by- 
trials design and its associated statis- 
tical techniques are useful tools for 
assaying the effects of various proce- 
dures and conditions upon performance. 
It is easy, however, to overlook the fact 
that the groups-by-trials design yields 
only information about relationships 
and trends which hold on the average 
over groups of Ss and series of trials. 
No accumulation of experiments, how- 
ever large, all conducted and all an- 
alyzed in accord with this same general 
method can provide a sufficient em- 
pirical check on concepts and assump- 
tions that refer to processes or events 
occurring in the individual learner. The 
findings we have considered in this 
paper suggest that in point of fact some 
of the most firmly entrenched concepts 
and principles of learning theory may 
be in a sense artifacts of a conventional- 


ized methodology- : 
The laboratory investigator of learn- 


s a O 
(1952), deviant experimental designs appear 
in references with the earliest dates but dis- 
appear as one comes down to thẹ contempo- 


rary literature, 
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ing is used to going his own way more 
or less oblivious to the rattle of criti- 
cisms from textbook writers, educators, 
and other “outsiders.” Traditionally 
the criticisms have always been to the 
same effect—that the “learning experi- 
mentalist” should push on from the 
worn out fields of conditioning and 
simple verbal learning, where there are 
really no unsolved problems of any im- 
portance remaining, and devote himself 
to richer, more complex, experiments 
that come closer to learning situations 
of real life (as contrasted with the un- 
real life found in the laboratory). Here 
we break with tradition, for the crit- 
icism generated by our work within the 
field suggests that progress toward a 
satisfactory theory of learning requires, 
not more complex, but simpler experi- 
ments. The conventional experiment, 
far from being oversimplified, repre- 
sents such a complex and intricate con- 
founding of stimulus and response var- 
iables over trials that once it has been 
done and reported in conventional form, 
no amount of study of the analysis of 
variance tables and Vincentized per- 
formance curves can disclose the effects 
exerted by specific causal variables on 
individual Ss upon particular occasions, 
Concepts depending solely upon the 
conventional experiment for support 
may turn out to belong, not to a psy- 
chology of learning, but only to a psy- 
chology of the criterion score, the mean 
performance curve, and the groups-by- 
trials design. 

In concluding, I would like to indi- 
cate that I do not mean to offer the 
experimental findings reported in this 
paper as a crucial test of incremental 
vs. all-or-none theory. What does seem 
clear with respect to the former is that 
the kind of evidence heretofore adduced 
in support of incremental theories is 
inadequate to distinguish them from al- 
ternative conceptions. More penetrat- 
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ing experimental analyses are required. 
We have made a start in this direction, 
and from the early returns it appears 
that no extant theory of the incremental 
type can handle the pattern of results 
that is emerging. The temptation is 
great to indulge now in a bit of specula- 
tion as to whether the answer to this 
situation will prove to be a remodeling 
of one of the familiar incremental the- 
ories, or perhaps a quite different the- 
ory based on all-or-none assumptions— 
but this is a step I do not intend to take. 

Since the earliest days of association- 
ism, overdependence upon speculation 
and circuitous inference has impeded 
the interplay of theory with experiment. 
While I would not for a moment de- 
preciate the role of imagination in sci- 
ence, I suspect that it will begin to 
serve us effectively in learning theory 
only as we begin to accumulate reliable 
determinations of the effects of single 
variables upon single learning trials in 
individual organisms. If by continual 
simplification of our experimental anal- 
yses and refinement of our mensura- 
tional procedures we can achieve these 
determinations, we may find that the 
long sought laws of association may 
be not merely “instigated,” or even 
“suggested,” but literally dictated in 
form by empirical data. 


REFERENCES 


Bruner, J. S., Mer, G. A., & ZIMMER- 
MAN, C.  Discriminatiye skill and dis- 
criminative matching in perceptual recog- 
nition. J. exp. Psychol., 1955, 49, 187-192. 

Busn, R. R., & MOosTELLER, F. Stochastic 
models for learning. New York: Wiley, 
1955. 


J. The psychology of learning. 
ed.) iene York: McGraw-Hill, 


Durort, R. H., & Kinse, G. A. Ready 
signals and the Ta of interpolated UCS 
presentations in eyelid conditioning. J: 
exp. Psychol., 1958, 56, 1-7. n 

Estes, W. K. Learning. Annu, Rev. Psy- 
chol., 1956, 7, 1-38. 


p 


LEARNING THEORY AND THE NEW “MENTAL CHEMISTRY” 223 


Estes, W. K. The statistical approach to 
learning theory. In S. Koch (Ed.), Psy- 
chology: A study of a science. Vol. = 
New York: McGraw-Hill, 1959, Pp. 380- 
491. 

Estes, W. K, Hopkins, B. L., & CROTHERS, 
E. J. All-or-none and conservation effects 
in the learning and retention of paired as- 
sociates. J. exp. Psychol., 1960, in press. 

Gooien, K. P., Ross, L. E., & WAGNER. 
A. R. Performance in eyelid conditioning 
following interpolated presentations of the 
UCS. J. exp. Psychol., 1957, 53, 214-17. 

Hincarp, E. R. Theories of learning. (Rev. 
ed.) New York: Appleton-Century- 
Crofts, 1956. 

Hutz, € L Principles of behavior. New 
York: Appleton-Century, 1943. 

Kenner H. H. Learning. Annu. Rev. 
Psychol., 1959, 10, 43-88. 

Kımsır, G. A., & Durort, R. H. The asso- 
ciative factor in eyelid conditioning. J. 
exp. Psychol., 1956, 52, 386-91. 

Kinair, G. A., Mann, L. I, & Durort, 
R. H. Classical and instrumental eyelid 


conditioning. J. exp. Psychol., 1955, 49, 
407-17. 


McGeocu, J. A., & Irox, A. L. The psy- 
chology of human learning. (Rev. ed.) 
New York: Longmans, 1952. 

Restle, F. A survey and classification of 
learning models. In R. R. Bush and 
W. K. Estes (Eds.), Studies in mathemat- 
ical learning theory. Stanford, Calif.: 
Stanford Univer. Press, 1959. i ; 

Rock, I. The role of repetition in associa- 
tive learning. Amer. J. Psychol., 1957, 
70, 186-193. 

Srence, K. A. Behavior theory and condi- 
tioning. New Haven: Yale Univer. Press, 
1955. 

Spence, K. A. Behavior the 
learning. In M. R. Jones 
symposium on motivation. 
ver. Nebraska Press, 1958, 

Torman, E. C. Purposive behavior in an- 
imals and men. New York: Appleton- 
Century, 1932. 

Vorxs, V. W. Gradual strengthening of 
S-R connections or increasing number of 
S-R connections. J. Psychol., 1955, 39, 
289-99. 


and selective 
), Nebraska 
Lincoln: Uni- 


(Received October 26, 1959) 


Psychological Review 
1960, Vol. 67, No. 4, 224-228 


ON PROBLEMS OF CONDITIONING DISCRIMINATED 
LEVER-PRESS AVOIDANCE RESPONSES * 


DONALD R. MEYER, CHUNGSOO CHO, ax» ANN F. WESEMANN 


Ohio State 


The learning of avoidance, as Solo- 
mon and Brush (1956) have re- 
marked, is something that we all take 
for granted. But, and they have put 
it particularly well, the 
failure of Ss to learn to avoid are not rare, 
but they are less apt to be reported in scien- 
tific papers than are the successes. And the 
conditions surrounding failures to learn to 
avoid are as instructive to the investigator, 
in many cases, as are the optimum conditions 
for producing the avoidance phenomenon. 


It is in this spirit that we offer 
certain data concerning failures to 
avoid. These have been collected in 
both formal and informal studies that 
have now occupied the better part of 
some two years. All have been con- 
ducted with rats, and all within the 
format provided by one or another 
leading brand of commercial lever- 
pressing apparatus. Some slight dif- 
ferences have been observed between 
the different brands, but we can con- 
clude at the outset that the problems 
herein described are not unique to any 
variant. Rather, the difficulties seem 
to be inherent in the lever-pressing 
situation, and not in different con- 
stellations as provided by the different 
fabricators. 


Tue Basic PROBLEM 


We must emphasize that such equip- 
ment can be used in training rats to 
make. stimulus-discriminated lever- 
press avoidance responses. Our con- 


1This research was supported in part by 
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cern is wholly with the rate at which 
such learning is established. We had 
first obtained this apparatus for some 
studies of the functions of the visual 
cortex; these investigations had been 
planned as extensions of the work of 
Meyer, Isaac, and Maher (1958). 
The bar-press situation was selected 
because we wanted to examine two 
responses that would be the same ex- 
cept for the incentive under which 
their learning was established. Fur- 
ther limitations thus imposed were 
two: that light be the warning stim- 
ulus, and that the habit be learnable 
within a single day. These did not, 
at first, appear to be unreasonable re- 
quirements. The primitive device that 
we had used before had worked well 
enough to give us learning in very 
rarely more than 100 light-shock pres- 
entations. It was our presumption 
that the lever-pressing habit could not 
be established quite so quickly, but we 
readily accepted this because of the 
convenience that the new equipment 
afforded. We were not, however, pre- 
pared in any way for our first em- 
pirical encounter, during which a rat 
failed to show reliable avoidance after 
several thousand trials. 

A few such experiences led us to 
conclude that we were doing some- 
thing very wrong; it did not seem pos- 
sible that lever-press conditioning 
could be as bad as this for the purpose. 
However, the first of our inquiries to 
persons familiar with the bar-press 
situation did not prove to be encourag- 
ing; we learned, for example, that it 
often takes as much as 40 hours to 
establish stable levels of performance. 
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Somewhat more salubrious estimates 
were nearer to 10, but it seemed that 
not all rats could be counted on even 
then. This perhaps should not have 
shocked us, but it did—for we were 
rémembering with sorrow that Mote 
(1940) had trained his average rat to 
turn a wheel within 50 light-shock 
presentations. The figure of merit for 
the lever-press response is, by this 
comparison, so poor that only in an 
automated era could it possibly survive 
as an indicant of choice. 

In retrospect, so far as our brain 
work was concerned, our course of 
action at this point should have been 
to build a Brogden-Culler apparatus 
and give up lever pressing forthwith. 
But it is most difficult to turn one’s 
back to a gleaming, thousand-dollar 
box; then too, we were still convinced 
that some combination of factors would 
produce efficient learning. Therefore 
We proceeded to assess the most ob- 
vious of these in an informal manner; 
bar weights, shock strengths, and light 
intensities were among the first con- 
siderations. Throughout these experi- 
ments, the general conditions were all 
very much the same: the warning stim- 
ulus consisted of a sharp illumination 
of the box; the box was darkened 
when the bar was pressed, and this re- 
sponse terminated shock on trials when 
the shock was not avoided. 

There was never any problem what- 
soever in training rats to make escape 
responses: indeed, after relatively few 
shaping trials—rarely more than 30 
or 40—the latencies on most trials 
were so short that it could be doubted 
that the shocks received were noxious. 
Frequently the rat would learn to 
hover just above the Jocus of the bar; 
this position was maintained through- 
out the presentation of the warning 
light. When the shock arrived, the 
rat depressed the bar very quickly, but 
in most instances it failed to learn to 


do this during the warning interval. 
To counter. this maneuver of “avoid- 
ance by escape” we next arranged a 
program such that shocks could al- 
ways be avoided, but that periodically 
were of a fixed duration rather than 
escapable. This of all the variables 
informally considered seemed to make 
at least some difference. 


Two ILLUSTRATIVE EXPERIMENTS 


Two formal studies serve to illus- 
trate just how difficult this problem is. 
We had hoped, because of certain 
problems of control, to train at least 
three rats a day; practically, the most 
that we could possibly invest in train- 
ing was a 4-hour session. We were 
therefore driven to consider massing 
as a possibility and hence trained 
three groups of nine rats each with 
intervals between trials of 20, 40, and 
60 seconds, respectively. Each trial 
consisted of light presentation for a 5- 
second period; pressing of the bar 
turned off the light and stopped or 
prevented the occurrence of a shock of 
1.6 milliamperes. The outcome, now 
so typical of many, was that the animals 
learned virtually nothing during their 
first training sessions, and this despite 
the fact that they had first been shaped 
to make escape responses and then had 
been presented with from 240 to 720 
light-shock pairings. ? 

Some small evidence of avoidance 
learning did appear when training was 
continued for a second daily sesston. 
The intermittent nonescape procedure 
then was instituted, and the rats were 
trained for three more days. But the 
final outcome, based upon a total of 
from 1,200 to 3,600 presentations, was 
that the 20- and 40-second groups still 
were not avoiding properly. For these 
groups, in fact, the median percent 
avoidance scores were .59 and 1.32. 
The 60-second group was somewhat 
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better: here at least some animals per- 
formed beyond a level of 50% avoid- 
ances. Only four, however, did so, 
and the median performance for the 
group was mere 6.69%. 

At this juncture, we began to specu- 
late that the problem might be in 
duration of the warning stimulus. This 
has been manipulated widely in dif- 
ferent experimental contexts; Warner 
(1932), for example, examined a range 
of from 1 to 30 seconds in a barrier 
situation. He found that learning was 
progressively faster as the warning 
interval decreased, but did not examine 
the range below 10 seconds in detail. 
In most relatively recent studies, these 
have been the intervals of interest, and 
5 seconds seemingly is most popular. 
Mote, on the other hand, presented 
light for just 2 seconds, and this in- 
terval was used successfully by Meyer, 
Isaac, and Maher. While we could 
not guess what the optimum would be 

in lever-pressing situations, it appeared 
to us that the best place to look was in 
the range below 5 seconds. 

For this experiment, the presenta- 
tions were given at a rate of one a 
minute; four groups of rats, 15 in a 
group, were trained for a total of six 
hours. There were two daily training 
sessions, each of which was three 
hours long; there were thus 180 pair- 
ings per day and 360 total pairings. 
We must say, perhaps, that these are 
few trials by operant conditioning 
standards; however, the number js 
seven times as large as that required 
for learning in a wheel. 

The differential treatments were 
confined to the first training session, 
Warning stimulus durations prior to 
shock were, for the four respective 
groups, 0, .5, 1.5 and 2.5 seconds, 
During the second session, this was 
fixed for all at a value of 5 seconds, 
This was done so that the superiority 

of one of the differential treatments 
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might be evidenced without the prej- 
udice of changing latency requirements. 
Here again, as in all the studies we 
have done, lever-pressing terminated 
light; if the animal did not avoid, light 
and shock went off together. 

These procedures yielded median 
performances approximating 20%; 
variations for the different intervals 
proved to be no more than 5%. But 
even these performances could not be 
taken as evidence for some avoidance 
learning; associated with them were 
bar-pressing rates that often reached 
extremely high values. There were 
many instances, in fact, of rates above 
two thousand per hour; had these 
rates been evenly distributed in time, 
at least a third of all the subjects 
would have shown us ultimately per- 
fect avoidance. We obtained, instead, 
a suppression of the rate when the 
warning stimulus came on; and this 
suppression was evidenced throughout 
the conduct of the study. 


INTERPRETATION 


Suppression, in itself, is not sur- 
prising. It is, indeed, the essence of 
the readily established CER of Hunt 
and Brady (1951); we expect it and 
we almost always find it regardless of 
our test technique. The question, 
rather, is why such suppression per- 
sists in lever-pressing situations, when 
its existence presumably implies a 
drive which lever pressing reduces. 
If light produces “fear,” and shock 
produces pressing, and pressing elimi- 
nates the light, how by any concept of 
avoidance learning can we fail to get 
avoidance ? 

We have found but one asylum, and 
that is in supposing that the offset of 
light is not an effective “fear-reducer. 
Having done so we can only wonder 
why we had expected it to be; in our- 
selves, we know that “fears” persist, 
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once evoked, far beyond their instiga- 
tors. We do not intend this statement 
as a glove to Mowrer and Lamoreaux 
(1951) however; all that they have 
said is that termination “will have the 
effect of reducing the rat’s fear at least 
a little and will thus reinforce, at least 
mildly . . .” the response which brings 
about the termination. Some rein- 
forcement is undoubtedly produced, 
and we would merely be inclined to 
substitute the word “trivially” for 
“mildly.” 

But why is the particular form of 
the response so important? Most ex- 
perimenters with whom we have dis- 
cussed this problem have considered it 
to be a matter of the relative availa- 
bility of the response to be conditioned. 
However, it is difficult to argue that 
responses sometimes made as many as 
20,000 times are lacking in availability. 
We believe, instead, that response to- 
pography is of consequence for other 
reasons and that we can best begin to 
show that this is so by first reviewing 
Schoenfeld’s (1950) concept of the 
process of avoidance. 

While recognizing that a light, such 
as we have used, becomes aversive 
after noxious pairings, Schoenfeld has 
emphasized the role of proprioception 
as linking avoidance to escape. In a 
sense, his view is that the rat, in avoid- 
ing, is escaping from its own response 
to the warning stimulus. Termination 
of the proprioceptive feedback is @ 
reinforcing operation, as is termination 
of the now-aversive warning stimulus 
itself. We believe that this approach 
is basically correct from the standpoint 
of its emphasis and that we can extend 
it to interpret the fact that apparatus 
can be ordered in efficiency for avoid- 
ance learning from wheel to shuttlebox 
to lever. ' s 

Briefly, the argument is this. Tf 
the termination of a Proprioceptive 
pattern has a reinforcing function, 


promptness and completeness will de- 
termine how effective this will be. 
Promptness and completeness will 
themselves depend upon our getting 
rid of the responses whose presence 
at the moment has yielded the propri- 
oceptive pattern to be terminated. 
Some responses will, of course, dis- 
appear upon removal of related stimuli, 
but others will persist unless inhibited 
through evocation of antagonists. The 
freezing-crouching pattern, or condi- 
tioned “fear,” is a member of this lat- 
ter group; thus the termination of its 
proprioceptive pattern will require re- 
sponse competition. 

Since the freezing-crouching pattern 
is a posture, suitable antagonists are 
few; in specifying these, we think we 
specify the quickly learned avoidance 
responses. These responses must, in 
scope, be commensurate with the pat- 
tern of conditioned fear, for when this 
is only partially the case the inhibition 
cannot be complete. The lever-press 
response thus is learned but slowly be- 
cause it is a poor antagonist; its oc- 
currence can be quite compatible with 
persistence of the prior pattern. But 
the situation is obviously different 
with a jump or running response; 
these yield us competition, termination 
—and rapid avoidance learning. 

Mowrer (1940), twenty years ago, 
remarked in passing that he had had 
no luck when he had tried to establish 
evidence of good avoidance learning 
with a “type of pedal-pressing Te- 
sponse.” With his conclusion we must 
heartily agree, and we think our tests 
have been exhaustive. However in- 
convenient the general implication, 
Operants are not arbitrary ; in avoid- 
ance learning, their selection is per- 
haps the most important of considera- 
tions. Possibly; a standard box can 
be so modified that suitable espone 
will appear; we ourselves are E 
this in a minor way with a “hit-the- 
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ceiling” pattern. Whether this will 
work as well as a shuttlebox is some- 
thing that remains to be seen, but 
never again will we regard such situa- 
tions as outmoded and inelegant sur- 
vivors of an era best forgotten. 
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There has been a growing research 
interest in originality during the last 
decade. Areas of major concern have 
been the study of the productivity of 
creative individuals (Dennis, 1958; 
Lehman, 1953; Taylor, 1956, 1958; 
Van Zelst & Kerr, 1951), their per- 
sonality (Barron, 1955; Drevdahl, 
1956; Taylor, 1956, 1958), the socio- 
logical and cultural factors promoting 
creativity (Barnett, 1953; Stein, 1953; 
Taylor, 1956, 1958), and investigations 
designed to develop reliable paper and 
pencil tests for the study of individual 
differences (Guilford, 1950; Spring- 
bett, Dark, & Clarke, 1957; Wilson, 
Guilford, & Christensen, 1953) .2 

Although there is general recogni- 
tion of the importance of the problem 
of devising training techniques for fa- 
cilitating creativity, most of the work 
in this important area has been anec- 
dotal or merely hortatory (Ghiselin, 
1955; Mearns, 1958; Osborn, 1957; 
Slosson & Downey, 1922). 


Scope oF Tuis PAPER 


The purpose of this paper is to re- 
view the experimental research that 
may be relevant to the problem of de- 
vising techniques for increasing origi- 
nality and to indicate the behavioral 
principles possibly involved in the pro- 
duction of the desired effect. 
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in Taylor (1956, 1958). 


For purposes of further discussion 
we shall distinguish between original- 
ity and creativity. Originality, or orig- 
inal thinking, as we shall use the term, 
refers to behavior which occurs rela- 
tively infrequently, is uncommon un- 
der given conditions, and is relevant 
to those conditions. Criteria of rele- 
vance and uncommonness, of course, 
must be established for any given situ- 
ation. This is a problem more readily 
solvable within the laboratory than 
without. Creativity, according to the 
present usage, refers to products of 
such behavior and the reactions of 
other members of a society to those 
products, An invention is a creative 
product that may have an important 
effect upon society and is a conse- 
quence of original behavior. But con- 
siderably more variables enter into the 
determination of creative works than 
originality alone. Our distinction im- 
plies that an individual may be highly 
original but not creative. His brilliant 
theories are never published, or they 
are not implemented by the necessary 
research. Perhaps the original re- 
search is conducted, but the results are 
not published or patented. All this 
may be done, but the work may be 
overlooked or the implications disre- 
garded by the appropriate community 
of scholars or society at large. Fur- 
ther common illustrations may be 
given, indicating that a great many 
more behavioral and societal variables 
influence creativity than originality, 
making the study of originality under 
simplified laboratory conditions more 
feasible than that of creativity. Many 
of these variables may affect original 
behavior as well, but they are not of 
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such overwhelming importance when 
the training of originality is consid- 
ered. Our subsequent discussion 
therefore will be concerned solely with 
originality as here defined. One fur- 
ther distinction in the experimental 
study of original behavior needs to be 
made. Such behavior is always rela- 
tive, either to a given individual’s past 
behavior or to the norms of a popula- 
tion of which he is a member (Barron, 
1955). The two are no doubt corre- 
lated. But it is the latter which is 
related to creative works; and since all 
of the pertinent experimental research 
has been of this kind, we shall be con- 
cerned only with the study of origi- 
nality relative to some group norm. 
Guilford (1959) also distinguishes 
between originality and creativity. 
However, the basis for the distinction 
stems from his factor analytic ap- 
proach. Originality, defined essen- 
tially in the same fashion as in the 
present paper, is one of several be- 
havioral traits contributing to creativ- 
ity. The latter is a more general be- 
havioral trait including as components 
several forms of flexibility, fluency, 
and motivational and temperamental 
traits in addition to originality. Al- 
though this is a reasonable empirical 
classification of traits based upon fac- 
tor analysis, it does not necessarily 
contradict the present one. We do 
not, however, accept Guilford’s addi- 
tional assertion that an-S-R approach 
cannot deal adequately with creativity. 


EARLIER ACCOUNTS OF ORIGINALITY 
TRAINING 


Although psychologists have done 
relatively little experimental research 
on the training of originality, non- 
psychologists have been concerned 
with the problem for some time, 
What is more, their suggestions, al- 
though not supported by evidence 
meeting the usual scientific standards, 


are frequently in accord with com- 
monly accepted behavioral principles. 

For example, Mearns (1958) in a 
book originally published in 1929 re- 
peatedly emphasizes that in order to 
facilitate the originality of school chil- 
dren in the arts the teacher must re- 
inforce, manifestly approve, his orig- 
inal efforts. He illustrates many 
techniques for inducing the child to 
make public his efforts, to gain the 
confidence of children so that com- 
munication will occur with their “sec- 
ret unexpressed selves.” Furthermore, 
the teacher is advised to approve of 
only the genuinely original effort. The 
teacher is further cautioned to wait 
patiently for the appearance of original 
behavior which is fostered by a “Der- 
missive atmosphere,” the absence of 
“drill” and excessive discipline. Ac- 
cording to Mearns, original behavior 
appears eventually because all normal 
children have an urge, energy, or im- 
pulse to be creative. 

Aside from the romantic approach 
to a problem of behavior, we would 
agree that the way to foster originality 
is to reinforce such behavior when it 
occurs. A basic difficulty is that it 
may not occur at all or at such in- 
frequent intervals that the reinforce- 
ments cannot shape up such behavior. 
Thus the fundamental problem in the 
training of originality is to devise 
methods for increasing its occurrence 
in diverse Situations, is to get it to 
occur in the first place, thereby per- 
mitting the Operation of reinforcement. 
If the operant level of originality 1s 
too low, conditioning will not be ef- 
fective, training cannot occur. We are 
assuming, then, that originality can be 
learned and that the same principles 
of conditioning hold as in other forms 
of operant behavior. Once it occurs 
it can be reinforced, approved, recog- 
nized. Tt will then show an increased 
probability of occurrence. An appar- 
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ent difficulty, however, is that origi- 
nality is manifestly different from 
other behaviors. How can the rein- 
forcement of one bit of uncommon 
behavior increase the frequency of 
other uncommon behaviors which by 
definition are different. Despite the 
apparent paradox, this sort of non- 
specific transfer does occur, as we 
shall see. 

Royce (1898), as far as we know, 
was the first to experimentally attack 
the problem of devising techniques for 
facilitating the occurrence of original- 
ity. He employed two different pro- 
cedures. First, Ss were asked to draw 
a series of nonsense figures unlike any- 
thing they had ever seen. A second 
procedure was to present a series of 
drawings to S with instructions to 
draw figures as different as possible 
from each of the models. Examination 
of the qualitative changes that oc- 
curred under these conditions led 
Royce to conclude that these proce- 
dures may facilitate originality and 
that they correspond to conditions in 
society that are conducive to original- 
ity. 

Slosson and Downey (1922) suggest 
another method for training and test- 
ing originality in writing fiction. They 
suggest writing a plot, characters, or 
an entire story, based upon the more 
unusual messages found in the per- 
sonal columns of newspapers. Head- 
lines are also proposed as training 
materials. Variations are possible in 
the use of the materials by placing 
restrictions on the kind of plot or 
characters to be invented. Thus, given 
an unusual message, the writer is 
forced to practice literary invention. 
The assumption is that this form of 
“mental gymnastics will increase the 
originality of his writing. 

A simple test of originality ae 
gested by Slosson and Downey (1922) 
is to determine how many different 


plots can be written to the same mes- 
sage, and for how many messages can 
plots be written in a given time in- 
terval. A more formalized test is also 
presented, but no empirical data are 
given from its use. Likewise, no data 
are presented on the effectiveness of 
the training method which appears to 
have been employed as a teaching tech- 
nique in creative writing courses at 
the college level. 

Osborn (1957) also believes that 
originality can be trained and that 
practice in producing original “ideas” 
will further develop originality. He 
gives many exercises designed to pro- 
vide such training. They typically 
consist of various kinds of problem 
situations for which as many different 
solutions as possible are to be offered. 
The quantity of different “ideas” is 
stressed, since it is believed that the 
likelihood of original “ideas” increases 
with increases in quantity. 

Osborn (1957) has also initiated 
a procedure of “group ideation” known 
as “brainstorming” which is designed 
to evoke original “ideas,” and which 
has received considerable attention 
from the public press. It is a modified 
group free association procedure with 
emphasis upon producing large num- 
bers of “ideas” in the absence of criti- 
cism or judgments as to their value. 
Critical evaluation of “brainstorming” 
sessions occurs later in order to avoid 
inhibiting unusual “ideas” during the 
session. 

Although there have been courses 
established to teach “brainstorming” 
in business and industrial firms as well 
as colleges, there is as yet little ob- 
jective research demonstrating its 
efficacy. 

Taylor, Berry, and Block (1957) 
found that group brainstorming did 
not produce a greater number of 
“ideas” or unique “ideas” as compared 
to individuals working alone and 
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whose performance was scored as a 
nominal group. In fact, it was found 
that the nominal groups were sig- 
nificantly superior to the brainstorm- 
ing groups’ under the conditions of 
their experiment. Corroborative evi- 
dence was found in another study by 
Taylor and Block (1957). These ex- 
periments, however, were not designed 
to study the effects of originality train- 
ing in groups or by individuals. They 
provide no information on the problem 
of training originality, whether giving 
many different solutions to problems 
results in an increased probability of 
giving original solutions to subse- 
quent different problems, 

Meadows and Parnes (1959) have 
recently reported that a course based 
upon the principles described in Os- 
born’s book (1957) produced a sig- 
nificant increment in originality in its 
students as compared to a control class 
taking a different course. But as in 
many classroom experiments, there is 
the difficulty of assessing what the 
relevant variables are, whether it was 
the training in problem solving per 
se, or the changes in motivation result- 
ing from participating in a class and 
studying a procedure which explicitly 
claims to facilitate problem solving, 
There are additional difficulties inher- 
ent in the study since no attempt was 
made to control or assess the extent of 
differential rehearsal of the test mate- 
rials in experimental and control 
classes. The experimental class prac- 
ticed on materials very similar to the 
originality tests given before and after 
the courses were taken. Finally, the 
design employed permits cs biased 
sampling of Ss, since the “creative 

problem solving” course was an elec- 
tive one. f 

The early psychological literature as 
well as the suggestions of the non- 
psychologists reviewed tend to agree 
upon a small number of different pro- 


cedures for increasing originality. 
One training procedure is to present 
an uncommon stimulus situation, a sit- 
uation for which common or conven- 
tional responses may not be readily 
available. Relatively uncommon re- 
sponses may be evoked as a conse- 
quence (Slosson & Downey, 1922; Os- 
born, 1957). A second procedure is 
the evocation of different responses to 
the same stimulus situation (Royce, 
1898; Osborn, 1957). Under such 
conditions the successive responses 
may become more uncommon. A third 
training procedure is the evocation of 
the uncommon responses as textual 
responses. Several recent experiments 
have employed this procedure (Jud- 
son, Cofer, & Gelfand, 1956; Maltz- 
man, Brooks, Bogartz, & Summers, 
1958; Maltzman, Simon, Raskin, & 
Licht, 1960) and will þe discussed 
shortly. Instructions, as distinguished 
from training, may also be used to in- 
crease originality, and appear to be 
effective under certain conditions 
(Christensen, Guilford, & Wilson, 
1957; Maltzman, Bogartz, & Breger, 
1958; Royce, 1898), 


“PRODUCTIVE” THINKING AND 
ORIGINALITY 


Before turning to the current ex- 
perimental research on originality 
training, however, note should be taken 
of several recent studies that have em- 
ployed Maier’s (1931) two-string or 
analogous problems as a test situation. 
The reason for commenting upon these 
Studies is the assumption that there 
is no fundamental difference in the be- 
havioral principles determining origi- 
nality and problem solving behavior 
generally. Both involve the evocation 
of relatively uncommon responses, 
otherwise the situation would not be 
called a problem or the behavior orig- 
inal. This is especially apparent im 
connection with Maier’s two-string 
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problem. Maier (1931) considers the 
pendulum solution to be a consequence 
of productive thinking whereas other 
solutions represent reproductive think- 
ing. As indicated elsewhere (Maltz- 
man, 1955) these problem solving be- 
haviors do not involve fundamentally 
different laws as demanded by Maier, 
but only a difference in the extent to 
which initial response hierarchies are 
modified. Furthermore, as far as can 
be determined this also is the basis 
that Maier used for distinguishing re- 
productive from productive solutions. 
The former solutions were given by 
his Ss upon the initial presentation of 
the problem. Only after these solu- 
tions were given and the Ss were told 
to find still another solution, which 
happened to be the pendulum solution, 
did Maier call it productive thinking. 
The empirical basis for the distinction 
between reproductive and productive 
thinking, then, was the probability of 
occurrence of the solution under his 
experimental conditions. The pendu- 
lum solution was relatively uncommon 
and is original in this sense. 

Several studies have attempted to 
determine, with some success, whether 
verbal responses necessary for the solu- 
tion of a subsequent problem are pres- 
ent in the initial response hierarchy 
of Ss (Saugstad, 1955; Saugstad & 
Raaheim, 1957; Staats, 1957). The 
determination is made by obtaining 
word associations to objects later 
found in the problem situation. Re- 
sponses necessary for the solution of 
the problem are presumably uncom- 
mon, have a low probability of occur- 
rence in the initial response hierarchy. 
An association procedure has also been 
used to determine the uncommonness 
ked in the initial re- 
sponse hierarchy and their ane ue 
subsequent success in the So ee 
problem (Maltzman, Brooks, Bogartz, 


& Summers, 1958). 


of responses evo 


A series of experiments by Judson, 
Cofer, and Gelfand (1956) is a limit- 
ing case of the use of textual responses 
in facilitating “productive” problem 
solving or originality. They employed 
the two-string problem as the test and 
rote learning as the training situation. 
Instead of attempting to increase the 
occurrence of uncommon responses 
during training and determining their 
transfer effects in the test situation, 
they induced directly the response 
chain necessary for a pendulum solu- 
tion to the problem. A group form 
of the two-string problem was em- 
ployed, and the Ss were required to 
write as many solutions as possible to 
the problem of grasping both strings 
simultaneously. The design of the ex- 
periment was to prompt a pendulum 
solution to the problem by presenting 
an experimental group with different 
lists of words, one of which contained 
the relevant response chain “rope,” 
“swing,” “pendulum.” Additional con- 
trol groups were employed receiving 
word lists including none of these 
words, one or all three but each in a 
different list. In this study as well as 
in two replications, the experimental 
group was superior to the control 
groups, although the differences were 
not always statistically significant. A 
further qualification is that these re- 
sults hold only for men. No reliable 
differences were obtained among 
women in the various conditions, An 
analogous experiment was conducted 
employing a group form of Maier’s 
hat rack problem, and comparable re- 
sults were obtained. These studies 
suggest that repeated evocation of the 
critical words during rote learning in- 
creased the probability of their occur- 
rence in the problem situation which 
in turn lead to an increased frequency 
of the desired problem solution. 

Another related study has been re- 
ported by Flavell, Cooper, and Lois- 
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elle (1958). Their study was designed 

to demonstrate that “functional fixed- 
ness” could be reduced by evoking un- 
common responses to an object during 
a pretest period. In one respect it is 
more pertinent to the problem of pro- 
ducing relatively nonspecific transfer 
than the studies by Judson, Cofer, and 
Gelfand (1956), because the responses 
evoked during the pretest period were 
not the appropriate ones in the subse- 
quent test problem. However, for our 
purposes the experiment suffers the 
limitation that it does not show an ac- 
tual facilitation of problem solving 
performance but only a reduction in 
“functional fixedness.” 

A series of studies by Maltzman, 
Brooks, Bogartz, and Summers 
(1958) combines features of both of 
these experiments (Judson et al., 
1956; Flavell et al., 1958) by evoking 
uncommon textual responses during a 
pretest period which are not involved 
in the problem solution in any obvious 
manner. Evidence of facilitation was 
obtained under these conditions. A 
screwdriver and a block of balsa wood 
were the objects available for use in 
the problem situation. Only the 
screwdriver was sufficiently heavy to 
serve as a weight permitting a string 
to swing in pendulum fashion. In the 
first experiment a control group was 
given the test problem without any 
pretraining. Lists of unusual uses for 
a screwdriver, balsa wood, and string 
were read by the experimental group 
before they performed in the problem 
situation. None of the uses referred 
to operations involved in the experi- 
ment. It was nevertheless found that 
the experimental group solved the 
problem significantly faster than the 
control group, and these results held 
for both sex groups. Additional ex- 
periments with modifications in the 
lists of unusual uses also obtained re- 
liable differences between experimental 


and control Ss, but typically in the 
case of women and not the men. 
Many variables in this experimental 
situation are in need of further analy- 
sis. However, the inherent lack of re- 
liability of the two-string problem as 
a test and the difficulties involved in 
obtaining experimental control over 
the relevant variables in the situation 
suggest that more fruitful studies of 
originality and problem solving gen- 
erally should employ another type of 
experimental situation. Also, with re- 
spect to the study of originality, test 
problems of this sort usually involve 
uncommonness of response only by in- 
ference or by definition. A more di- 
rect measure of originality would be 
highly desirable. 


Current STUDIES OF ORIGINALITY 
TRAINING 


Because they seemed to meet the 
desired requirements, tests of original- 
ity of the kind developed by Guilford 
(1950) and his associates (Wilson, 
Guilford, & Christensen, 1953) as well 
as earlier writers (Hargreaves, 1927; 
Slosson & Downey, 1922) were 
chosen for a more detailed experi- 
mental analysis of variables involved 
in training originality by Maltzman 
and his associates (1958, 1960). 

In the first experiment in this series 
(Maltzman, Bogartz, & Breger, 1958) 
the Ss were run individually. All of 
the subsequent studies were conducted 
in group form. A modified free asso- 
ciation procedure was employed in 
which the stimulus words were Se- 
lected on the basis of their restricted 
response hierarchies. Since a rela- 
tively small number of different re- 
sponses are evoked by the stimuli, a 
high degree of communality occurs 
among their responses. A control 
group received an initial list of 25 
words to which they gave free asso- 
ciations and a different final list of 25 
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words to which they gave free asso- 
ciations. This was followed by the 
Unusual Uses Test of originality 
(Guilford, 1950). The latter test con- 
sists of a series of names of six com- 
mon objects for which S must give 
different uses other than their common 
everyday use. 

An experimental group received the 
same treatment except that the initial 
list was presented five additional times 
with instructions to give a different re- 
sponse on each repetition. A second 
experimental group was treated in the 
same fashion except that Æ said 
“good” after every approximately 
fifth uncommon response during train- 
ing. Each of these three groups was 
subdivided prior to the introduction of 
the free association test list. Half the 
Ss in each condition were instructed 
to try to be as original as possible, and 
half were told nothing. Each of these 
subgroups were further subdivided 
prior to administration of the Unusual 
Uses Test and instructed to be as 
Original as possible. These latter in- 
Structions may have been redundant, 
if not confusing, since the written in- 
structions to the test already implied 
as much, 

Originality measures were obtained 
for performance on the free associa- 
tion list by determining the frequency 
with which the responses to each stim- 
ulus word occurred. The mean fre- 
quency of his associative responses was 
taken as each S’s originality score. 
Thus the lower the score the more 
uncommon are S’s responses on the 
average. The total number of unique 
responses given to the items v vas taken 
as the measure of originality on the 
Unusual Uses Test, where a unique Te- 
sponse was defined as a use occurring 
only once in the sample. Results of 
the experiment showed that both train- 
ing and instructions produced a sig- 
nificant increase in originality on the 
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free association test list. What is 
more, the experimental subgroups re- 
ceiving training in addition to instruc- 
tions were significantly more original 
than the control group receiving in- 
structions alone, indicating that the 
training may produce its effects inde- 
pendently of instructions to be original. 
Results obtained from the Unusual 
Uses Test were ambiguous, yielding 
a significant triple order interaction 
where the experimental subgroup 
without originality instructions was 
significantly more original than its con- 
trol group. However, results on the 
Unusual Uses Test from subsequent 
experiments have consistently shown 
reliable training effects. 

Despite the above evidence indicat- 
ing that the training procedure facili- 
tated originality, additional informa- 
tion is needed to determine precisely 
what characteristics of the procedure 
are responsible for the effect. Differ- 
ences between the experimental and 
control groups may have been due to 
the difference in number of stimulus 
words presented, number of responses 
evoked, number of different responses 
evoked, ete. The fact that different 
responses were evoked by the same 
stimuli may be neither necessary nor 
sufficient for an effective training 
method. Additional research was 
therefore conducted in an attempt to 
isolate the relevant variables. 

In one study (Maltzman et al., 
1960) the control condition and the 
experimental condition without verbal 
reinforcement or instructions to be 
original were again employed. The 
latter group received instructions to 
give a different response to each pres- 
entation of the same stimulus word. 
For convenience they will be called the 
standard control and experimental 
conditions, respectively. Two addi- 
tional experimental groups received 
lists of 125 different stimulus words, 


236 


the same number as the repetitions of ' 


the initial list for the standard experi- 
mental group. One group received 
high frequency count stimulus words 
while the other group was ian 
with relatively low requency coun 
words (Thorndike & Lorge, 1944). 
This procedure equated the experi- 
mental groups for the total number of 
different responses. They differed in 
that the responses all occurred to dif- 
ferent stimulus words in the new ex- 
perimental conditions whereas differ- 
ent responses occurre 
stimulus words in the standard experi- 
mental condition. Another control 
group was added which received the 
same number of repetitions of the ini- 
tial list as did the standard experi- 
mental group, but these Ss were in- 
structed to try to give the same re- 
sponse at each repetition of a given 
stimulus. This condition would aid 
in determining whether the number of 
Tesponses per se, or the number of dif- 
ferent responses, is the relevant vari- 
able influencing test performance, 
Results from the free association 
test of originality showed that the three 
experimental gtoups did not differ sig- 
nificantly from each other and the two 
control groups did not differ signifi- 
cantly. But each of the experimental 
groups was significantly more original 
than each of the control groups. On 
the Unusual Uses Test the standard 
experimental group did significantly 
better than all the other groups. The 
two experimental groups receiving 
different stimulus words during train. 
ing did not differ from the standard 
control group, while the new control- 
condition was significantly less original 
than each of the other conditions, 
The results from this experiment in- 
dicate that under some conditions the 
evocation of à relatively large number 
of different řesponses to different stim- 
uli may increase originality to the same 


d to the same 
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extent as the method of repeated evo- 
cation of different responses to the 
same stimuli. However, the two 
methods apparently differ in the ex- 
tent to which they produce transfer 
effects, as indicated by the significant 


difference between the two methods on , 


the Unusual Uses Test. These results 
show that the evocation of different 


wi aa BRN 
responses to the same stimuli is an im- 


portant aspect of the standard experi- 
mental procedure, although arousal of 
many different responses by different 
stimuli is also effective, within limits. 
The significant loss of originality by 
the new control group was not entirely 
unexpected, and shows that a training 


the preceding free association test is 
1S a problem that re- 
© experimental investiga- 


Another experiment (Maltzman et 
al., 1960) was designed to further 
explore possible relevant variables and 
alternative Procedures for facilitating 


originality, 

The standard control and experi- 
mental conditions were again em- 
ployed. Another experin 


nental con- 
dition was given Tepeated presentation 
of items from the Unusual Uses Test 
in order to determine whether the ef- 
fectiveness of the training procedure 
is limited to only certain kinds of ma- 
terials, This was accomplished by 
Presenting the list of six items a total 
of six times With instructions to give a 
different use prior to each repetition. 
A different kind of training procedure 
was employed with a third experi- 
mental group. They were presented 
with a booklet consisting of 125 dif- 
ferent Pairs of words, The Ss were 


aS) 


; 
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instructed to indicate the member of 
ach pair which goes more readily 
with the stimulus word orally pre- 
sented to them. The same stimulus 
words and number of ‘repetitions as 
employed with the standard experi- 
mental conditions were administered. 
Each pair of words were unique re- 
sponses to the stimulus words selected 
from the previously established norms. 
A fourth experimental group was 


“given the word pairs and asked to 


indicate which member of each pair 
they thought to be more familiar, The 
stimulus words were not presented. 
Results of the free association and Un- 
usual Uses Tests showed that the 


standard experimental condition was | 


significantly more original than the 
other four groups on both tests, while 
the other groups did not differ sig- 


nificantly from each other, 


In one of the new conditions uncom- 
mon responses were evoked as textual 
responses following the stimulus 
words. - In the other condition only 
the textual responses occurred. These 


conditions were introduced in order to - 


determine whether the uncommon re- 
Sponses must be evoked by the stimu- 
lus words in order for the training to 
be’ effective. The fact that these 
groups were significantly less original 
than the standard experimental group 
Suggests that, at least with the training 
and test materials employed, uncom- 
mon responses must be evoked as in- 
traverbal responses by the stimulus 
words if there is to be an increase in 
originality in the test situations, 

Since the experimental condition 
which received original training with 
items from the Unusual Uses Test 
failed to show a significant training 
effect, another experiment was con. 
ducted to determine whether the failure 
Was due to an inadequate amount of 
training. Four conditions were em- 
Ployed in this study receiving different 
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amounts of training. None of the 
groups showed a significant increase 
in originality on the free association 
test when compared with the standard 
control condition. y 

Results from the latter experiment 
and one condition from the previous 
experiment showed that training with 
unusual uses items in the same fashion 
as word association materials does not 
produce a comparable increase in orig- 
inality. Thus the method of repeated 
evocation of different responses to the 
same stimuli does not work with all 
materials. 

A basic assumption of our approach 
is that, if original behavior can be in- 
duced to occur repeatedly through 
some training procedure, it will in- 
crease in its frequency of occurrence 
in new situations. If this condition 
is not met, transfer of originality to 
new situations cannot occur. Inspec- 
tion of the unusual uses training data 
indicated that an appreciable increase 
in originality did not occur during 
training. In the case of training with 
free association materials a marked in- 
crease does take place. 

Another experiment was conducted 
in which different groups received 0, 
1, 5, or 10 repetitions of the initial 
word association list before testing on 
another word association list and the 
unusual uses. A significant trend 
was evidenced on both originality tests, 
with a single repetition of the tee 
list producing a significant nt). 
originality (Maltzman et al., 19 hae 

On the basis of the current dearth o 
relevant information, we can only Be 
ulate as to why training with ee 
uses failed to induce an Imcerease z 
the uncommonness of the ed i 
One hypothesis that may beg a 
based upon the apparent relativ A ni 
i f different uses availa 
d nue a Given the instructions 
< 2 different use to each repeti- 
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tion of an item and the inability of Ss 
to do so with a different operant, they 
respond with essentially synonymous 
uses repeatedly, a condition, as we 
have already seen, not likely to facili- 
tate originality. cw 
If, as we have assumed, originality 
can be learned according to the prin- 
ciples of operant conditioning, then 
the effects of originality 
should be of the same kind 
ordinarily obtained in experimental 
studies of learning. One of the gross 
characteristics of learning is that the 
behavioral changes produced by rein- 
forcement persist for some time, are 
not relatively transistory. Increased 
originality induced by the training 
method previously described should 
therefore also persist. An experiment 
designed to investigate this property 
of originality training has been con- 
ducted, and positive results have been 
obtained (Maltzman et al, 1960). 
Using the standard control and experi- 
mental conditions of the previous ex- 
periments, a delay of approximately 
one hour was interposed between the 
training and test situations for a con- 
trol and an experimental group, and 
a delay of two days for another con- 
trol and experimental group. 
nificant training effect w. 
the word association an 
Uses Test of originality. 
These results show 
of originality training 
nificant degree for 
under the given e 
tions. We may conclude from these 
results and that of the previous ex- 
periment that the standard experi- 
mental procedure for training original- 
ity produces some kind of learning, 


training 
as those 


A sig- 
as obtained on 
d the Unusual 


that the effects 
persist to a sig- 
at least two days, 
Xperimental condi- 


RELATION oF ORIGINALITY TRAINING 
TO LEARNING TO LEARN 
Originality 


training studies of the 
kind reviewed 


here and learning to 
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learn studies (Duncan, 1958; Bario 
1949; Morrisett & Hovland, 1959) 
have the common characteristic of 
producing relatively nonspecific trans- 
fer effects. Training on given stimuli 
transfers to other stimuli in the absence 
of any obvious opportunity for re 
Operation of primary stimulus general- 
ization. Despite the apparent similar- 
ity between the results of training i 
learning to learn and those i 
here, differences in the experimenta 
operations and the kind of behavior in- 
duced in these procedures suggest that 
different principles are involved in 
these situations. . 
The learning to learn procedure in- 
volves the learning of a great many 
different discrimination problems. 
Different stimuli are presented repeat- 
edly until S has learned a differential 
response to a criterion or has received 
a given number of trials. The re- 
Sponses are not varied and uncommon 
responses are not reinforced, nor can 
they be, because a Specified response 
is correct, Under these conditions Ss 
show a progressive decrease in the 
number of trials required to learn new 
discrimination problems, Tn original- 
ity training, on the other hand, vari- 
ability of response is induced, Re- 
peated presentation of the same stimuli 
and reinforcement of a specific pre- 
designated response likewise does not 
occur in originality training. In learn- 
ing to learn, the stimuli vary from 
problem to problem, but the ee 
do not. In originality training an 
test situations the stimuli and the re- 
Sponses vary, Thus the two proce- 
dures differ in terms of the operations 
employed to induce nonspecific trans- 
fer, Furthermore, the interpretations 
offered by Duncan (1958) in his a 
Prehensive study of learning sets a 
the human level do not appear appli- 
cable to originality training. The r 
quisition of observing responses, habits 
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of looking at details of stimuli, etc. 
are important in discrimination prob- 
lems but do not appear to be particu- 
larly relevant in originality training 
situations of the kind described here. 
Likewise, the analysis of learning to 
learn by Morrisett and Hovland 
(1959) in terms of the discrimination 
between stimulus trace patterns does 
not appear applicable to originality 
training. 


AN INTERPRETATION OF ORIGINALITY 
TRAINING 


Two basic problems, not unrelated, 
posed by the experiments on original- 
ity training by Maltzman and his asso- 
ciates (1958, 1960) demand an inter- 
pretation. First is the nature or 
source of the reinforcement for origi- 
nality. In all of the experiments, suc- 
cessive training trials in the standard 
experimental conditions are accom- 
panied by increases in the uncommon- 
Ness of the responses. The originality 
induced in this manner transfers to 
different stimulus materials, and these 
effects tend to persist. Originality 
training thus produces behavioral 
changes that are characteristic of learn- 
ing, yet they occur in the absence of 
differential reinforcement administered 
by E. Nevertheless, we believe that 
uncommon responses received differ- 
ential reinforcement, Reinforcement 
was obtained, because the occurrence 
of any previously established intra- 
verbal association is self-reinforcing. 
As Thorndike puts it: “The mere oc. 
currence of a connection strengthens 
it” (1949, p. 23). Thus, where there 
is an initial probability greater than 
zero that a given verbal stimulus will 
evoke a verbal response in an operant 
Situation such as word association, the 
Occurrence of that response will in- 
Crease the probability of its occurrence 
On a subsequent presentation of that 
Stimulus. Different initial conditions 
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may only increase or decrease the 
amount of reinforcement affecting the 
stimulus-response connection. In the 
type of experimental situation with 
which we are concerned here, the un- 
commonness of the response evoked by 
the stimulus is one of these conditions. 
In other words, the amount of rein- 
forcement is inversely related to the 
initial probability of a verbal stimulus 
evoking a verbal response. Originality 
is more reinforcing than commonplace 
responding. 

An impression gained from observ- 
ing Ss in the experimental situation is 
that repeated evocation of different re- 
sponses to the same stimuli becomes 
quite frustrating: Ss are disturbed by 
what quickly becomes a surprisingly 
difficult task. This disturbed behavior 
indicates that the procedure may not 
be trivial and does approximate a non- 
laboratory situation involving origi- 
nality or inventiveness, with its fre- 
quent concomitant frustration. ee 
impression of frustration accompany- 
ing the task of producing repeated un- 
common responses would lend support 
to the notion that when such responses 
do occur they are self-reinforcing to a 
considerable degree. These specula- 
tions, however, are not essential to the 
investigation of the initial hypothesis 
of the self-reinforcing character of un- 
common responses. However, these 
as well as other considerations suger 
that the distribution and scheduling pi 
the evocation of uncommon he! al 
may be of great importance in i 
mining the extent to which a s 
fer effect will be obtained. ; 

Tt should be noted, however, tte 
the role of self-reinforcement is pro - 
ably less significant im producing a 


increase in originality during training, 


the successive repetitions of the same 
word list, than in the test situation. 
Since instructions to the standard ex- 


perimental group prohibits the reoc- 


240 


currence of the same response to a 
given stimulus word, successive re- 
sponses to that word will be progres- 
sively lower in the related response 
hierarchy. This is sufficient to account 
for the increase in the uncommonness 
of responses with successive repetitions 
of the same stimulus words. How- 
ever, we believe that a second variable 
contributes to the increase in origi- 
nality during the training session, gen- 
eralization of the reinforcement effects 
from the evoked response to still more 
uncommon responses in the same 
hierarchy. However, the role of self- 
reinforcement becomes significant 
when the transfer of originality to the 
test situations is considered, In this 
case the carrying out by S of previ- 
ously administered instructions to give 
different responses will not account 
for the greater originality of the stand- 
ard experimental condition in the test 
situations. 
This leads to the second problem 
presented by the experimental data: 
the fact that the evocation of uncom- 
mon responses on one list of stimulus 
words facilitates the occurrence of 
original responses on a different. list 
of words, and even more perplexing, 
on the rather different Unusual Uses 
Test. The problem is to account for 
the fact that reinforcement of one in- 
stance of original behavior increases 
the response tendency for other in- 
stances of such behavior under differ- 
ent stimulus conditions. A relatively 
nonspecific kind of transfer of training 
is induced. Without it, the facilitation 
of originality would not be possible. 
We do not have any simple rigorous 
explanation of this phenomenon at the 
present time, but offer the following 
tentative suggestions. The intraverhal 
associations possessed by a normal 
educated adult are enormously exten- 
sive and complex. In all likelihood, 
almost every verbal response is associ- 
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ated to some extent with every other. 
Furthermore, the intraverbal associa- 
tions among common verbal responses 
are stronger than between common 
and uncommon responses. Likewise, 
the intraverbal associations among un- 
common responses are relatively 
stronger than between uncommon and 
common. The evocation and rein- 
forcement of uncommon responses 
will, through complex kinds of medi- 
ated generalization, therefore increase 
differentially the probability of occur- 
rence of other uncommon RESPONSES: 
even though they do not ordinarily 
occur to the same stimuli as the orig- 
inal reinforced responses. In a similar 
fashion, the evocation of common re- 
sponses would differentially increase 
the probability of occurrence of other 
common responses even though they 
may be members of different response 
hierarchies. An additional character- 
istic of the standard experimental pro- 
cedure used in originality training is 
that it provides for the inhibition of 
common responses as well as the fa- 
cilitation of uncommon responses. 
Through mediated generalization the 
effects of inhibition will produce a dec- 
rement in the excitatory potential of 
other common responses. It is this 
characteristic which may be responsible 
for the more general transfer effects 
obtained with the standard experi- 
mental procedure than with the proce- 
dures that induce uncommon textual 
responses or employ uncommon stimu- 
lus situations for evoking, in ae a 
common responses. None i ie 
other procedures would involve 
inhibition of common responses. — : 
It should be clear that the foregoing 
hypotheses are speculative and a 
connection with the data presented 
tenuous. Considerable careful gapen 
mentation tracing the intraverbal Ve 
Sociations present in the esperimen 
situation is needed as well as the ¢ 
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velopment of laws of verbal com- 
pounding before a completely adequate 
explanation of the basis for originality 
training can be given. Whether or not 
such explanations can be given, the 
method of training originality de- 
scribed in this paper seems worthy of 
further study in its own right, and 
potentially may have considerable prac- 
tical application. This again is a prob- 
lem requiring sustained experimental 
research employing different kinds of 
test materials, 

In conclusion, it should be empha- 
sized that little is to be gained by dis- 
puting the usage of the term “origi- 
nality” as it has been employed here, 
or whether the current experiments de- 
scribed are investigating the essence of 
originality, The important problem is 
the experimental one of determining 
the variables influencing the occur- 
rence of uncommon responses in rela- 
tively simple situations and the func- 
tional relationships that obtain in these 
Situations. Finally, the extent to 
which these effects hold for more com- 
plex behavioral situations must be de- 
termined. Such work is now in prog- 
ress in the UCLA laboratory. But to 
designate only highly complex be- 
havior of a given sort as original to 
the exclusion of simpler better con- 
trolled behaviors seems gratuitous, as 
well as an obstacle to the progress of 
research in this vital area. 


SuMMARY 


The basic problem in the training of 
originality is to devise a means of in- 
creasing the frequency of uncommon 
behavior. Once it takes place it may 
receive reinforcement and increase the 
probability that other original behavior 
will occur. Earlier attempts to devise 
training methods for originality were 
briefly mentioned, as well as related 
Studies of problem solving. 


A series of experiments by Maltz- 
man and his associates was reviewed, 
and a procedure which consistently 
facilitated originality was described. 
This procedure involves the repeated 
presentation of a list of stimulus words 
in a modified free association situation 
accompanied by instructions to give a 
different response to each stimulus. 
Under these conditions the responses 
become more uncommon. When pre- 
sented with new stimulus materials, Ss 
receiving such training are reliably 
more original than Ss receiving no 
training. Tentative suggestions as to 
the behavioral bases for the training 
effect were given. 
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PREPARATORY SET VARIABLES RELATED TO CLASSICAL 
CONDITIONING OF AUTONOMIC RESPONSES? 
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Experiments on conditioning with hu- 
man subjects (Ss) are complicated by 
the fact that response change or “learn- 
ing” can take place at different levels, 
sometimes facilitating and other times 
interferring with the experimenter’s 
(2s) goals. Razran (1955) empha- 
sized this point when he distinguished 
among five relations of conditioning and 
perception. Of his categories two are 
particularly relevant here, representing 
a distinction between “conditioning 
without perception” and “conditioning 
with perception,” the difference de- 
pending largely upon whether the rela- 
tion between the unconditioned reaction 
and the to-be-conditioned reaction is 
perceived by S. 

As a specific example consider stud- 
ies conditioning the GSR or change in 
heart rate with human Ss. Over twenty 
years ago Cook and Harris (1937) 
demonstrated one trial “verbal condi- 
tioning” of the GSR in which the 
strength of the response was not de- 
pendent upon the number of reinforce- 
ment trials. They concluded, not only 
that the GSR is verbally conditioned, 
but also that such “verbal conditioning 
is impossible to prevent in conditioning 
by the usual procedure.” In spite of 
this early result, most GSR “condition- 
ing” studies since that time appear to 
have been conducted either ignoring 
verbal-perceptual factors or assuming 
that they are controlled by instructions 
designed to mislead the Ss concerning 
the nature of the experiment. 
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When the “perceptual” problem has 
been attacked directly the favored ap- 
proach has been to manipulate ver- 
bal and attitudinal conditions through 
changes in instructions to S, informing 
him of stimulus relations or asking him 
to take a voluntary part in the process. 
Early studies of this type were summa- 
rized by Hilgard and Marquis (1940). 
Later studies have not emphasized 
autonomic responses, although Lacey 
and Smith (1954) and Lacey, Smith, 
and Green (1955) demonstrated heart 
rate conditioning differences between 
“aware” and “unaware” Ss; and Grings 
and Kimmel (1959) noted marked in- 
struction effects in GSR discrimination 
learning. References to the evanescent 
nature of GSR conditioned responses 
are common (Grant & Schiller, 1953, 
Grings & Shmeley, 1959), and the im- 
plication is usually made that the GSR 
is sensitive to transient changes in hu- 
man Ss, 

The above is ample to suggest that 
the handling of verbal perceptual set 
factors in human autonomic condition- 
ing remains a problem of major impor- 
tance. Data available are limited to 
demonstrations of the existence of per- 
ceptual factors, with very few attempts 
to define operations for their detection, 
evaluation, and control. Extension of 
understanding of the conditioning proc- 
ess with human Ss may result from 
exploration of new operations for in- 
ferring and manipulating perceptual 
sets in situations resembling classical 
conditioning. 
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THE PERCEPTUAL Disparity 
RESPONSE 


The present research introduces an 
Operation for inferring the existence of 
a perceptual set in conditioning studies, 
one that is not based on verbal instruc- 
tions to the S. The concept is based on 
the premise that human Ss, in situa- 
tions with conditioned stimuli preced- 
ing noxious stimulation, use the cue as 
a preparatory signal—that is, the re- 
sponse to the conditioned stimulus (or 
what is, in effect, the conditioned re- 
sponse) is partly a preparatory set for 
receipt of the uncondition 
stimulus, 

It is proposed that the 
perceptual set aspect of conditioned re- 
sponses can be estimated from per- 
ceptual disparity situations, 


ed noxious 


Strength of a 


or from 
situations in which a cue stimulus jis 
followed by an unconditioned stimulus 


different from that which had been as- 
Sociated with that cue previously, 
other words, it is assumed that there 
will be a difference in magnitude of 
Tesponse between situations where re- 
ceipt of stimulation is in accord with 
past experience and where such receipt 
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of stimulation is not in accord with past 
experience, It is further assumed that 
a perceptual disparity variable, so de- 
fined, will be related to meaningful de- 
termining conditions in human behavior 
and lead to explanatory possibilities 
which go beyond more conventional 
definitions of conditioned responses, — 
To test the feasibility of such a dis- 
parity variable, several experiments 
have been conducted and will be de- 
scribed briefly below along with later 
studies amplifying upon the concept, 
In one experiment, three groups of siis 
Ss each were taught that when they 
moved a lever in one direction they 
would receive a tone stimulus and when 
they moved the same lever in the op- 
posite direction they would receive a 
bright light, The groups were given 
different numbers of training trials 
(four, six, and eight) during which the 
above regularities applied. Then they 
were administered a test series contain- 
ing both disparity and nondisparity 
trials. Disparity was achieved by fol- 
lowing with a ton 
movement whic! 
followed by ligt 


e the direction of 
h previously had been 
nt, or by following with 
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a light the direction of movement which 
had previously been followed by a tone. 

The results from the test series are 
plotted in Figure 1 where it can be seen 
that the magnitude of response on dis- 
parity trials was larger than the re- 
sponse on the nondisparity trials for 
all groups and the difference between 
the curves increases with larger num- 
bers of training trials. The difference 
between the disparity and nondisparity 
responses tested overall by a ¢ test was 
significant at the 1% level: the effect 
of the number of reinforcements var- 
iable tested as an analysis of variance 
of the subgroup mean differences be- 
tween disparity and nondisparity re- 
sponses was not significant. 

A second, similar study (Grings & 
Kimmel, 1958) was conducted using 
tone and shock stimuli rather than tone 
and light as UCSs. Sixty Ss were ob- 
served, 30 being given 4 reinforcements 
hefore test and 30 being given 10 rein- 
forcements before test. Again the test 
series contained both disparity and non- 
disparity trials. The results, shown in 
Figure 2, agree with those in the first 
study, i.e., for both groups the response 
(GSR) on disparity trials is larger 
than on nondisparity trials (significant 
at the 1% level) and the size of the 
difference between the disparity and 
nondisparity responses increases as a 
function of the number of reinforce- 
ments. 

On the basis of these and related 
studies it was felt that the assumption 
of a disparity variable was reasonable, 
and attention was turned to the manip- 
ulation of the variable by introducing 
appropriate independent variation to 
which it might be related. 


DETERMINERS OF THE PERCEPTUAL 
Disparity RESPONSE 
On the basis of the already cited 


Studies it was hypothesized that at least 
two major classes of determiners of the 


or AUTONOMIC RESPONSES 245 
disparity response exist. One involves 
the strength and stability of the antic- 
ipatory response, which in turn appears 
to be a function of the number of rein- 
forcements of the cue-consequent rela- 
tions. If preparatory set responses in- 
crease and stabilize as most habits do. 
the more frequent the sequence cue- 
reinforcing stimulus the more strong 
and consistent should he the organism's 
preparation for the reinforcing stim- 
ulus, other variables held constant. 
Elaboration of this point, however. 
needs to be made, for the operation 
of competing processes, like response 
adaptation, is clearly evident in Figures 
1 and 2. Further discussion of this 
matter is made later. In addition, nu- 
merous factors related to the condi- 
tioned stimulus used (such as its in- 
tensity, discriminability, etc.) probably 
determine the strength of the condi- 
tioned preparatory response. i 

The second major class of determiner 
of the magnitude of the disparity re- 
sponse would include the direction of 
the disparity—i.e., whether actual stim- 
ulation was stronger or weaker than 
that for which S was prepared—and the 
amount of disparity measured on some 
scale of stimulus distance from antic- 
ipated to actual stimulation. 

Several studies have been conducted 
to explore the hypotheses outlined 
above, two examples of which are given 
below. One of these has been reported 
by Kimmel (1960). He was testing 
the relation between amount of classical 
conditioning of the GSR and the in- 
tensity of the CS. Tone CSs of inten- 
sities of 35, 55, 75, 95, and 115 dh, re: 
0002 dyne/em? were used with an 
electric shock UCS. In attempting to 
control differences in response strength 
to cue stimuli during an extinction 
series he subdivided each acquisition 
CS intensity group into three extinc- 
tion intensity subgroups (45, 75, and 
105 db.). Since at the time of the first 
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Fic. 3. Magnitude of perceptual disparity 
response as a function of amount and direc- 


tion of stimulus change. (Reproduced from 
Kimmel, 1960.) 


extinction trial S had just been given 
20 trials during which the first stimulus 
he received on each trial was the acqui- 
sition intensity, it may be assumed that 
between trials he remained prepared 
for that intensity. When, therefore, he 
received an intensity different from the 
previously experienced one, he might 
respond to the disparity. Kimmel’s 
data are presented in Figure 3 in 
terms of GSRs resulting from different 
amounts and directions of change. The 
effects of both direction and amount of 
change were reported as Statistically 
significant, 

Similar results were obtained in a 
study by Grings, Dossett, and Honnard 
(1959), Forty-eight college students 
were randomly separated into two ex- 
perimental groups, One group was 
given conditioning training with 100% 
reinforcement, the other with 67% re- 
inforcement. Two cue stimuli ( colored 
light CSs) were followed by a 50-db, 
tone or a tone of different intensity, 
Four subgroups of 6 Ss each were 
trained to different second stimuli of 
either 30, 70, 90, or 110 db. re: 0002 
dyne/cm?. A standard number of 
eight reinforcements of each cue-tone 
combination was followed by ʻa dis- 
parity trial. . Since each disparity trial 
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was made up of the cue to the 50-db. 
stimulus followed by the actual “other 
stimulus, there resulted four terea 
degrees of discrepancy: —20 db.. 20 
db., 40 db., and 60 db. 

The results of this experiment are 
presented in Figure 4 where mean re- 
sponse to the UCR component of the 
disparity trial (adjusted for reference 
level of the UCS involved) is given as 
a function of amount and direction of 
disparity. When an analysis of ‘var- 
iance was made on the data shown, the 
difference between the 100% and 67% 
groups was found to be significant Ber 
yond the 5% level (F = 6.80, df = 2 
and 6). The difference between the 
disparity subgroups, as based on the 
values portrayed by the combined 
groups curve in the figure, fell just 
short of the 5% level of significance 
(F = 2.25, df = 4 and 6), 

On the basis of the above studies, it 
is tentatively concluded that the greater 
the amount of discrepancy between 
anticipated and actual stimulation the 
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Fic. 4. Mean responses to the UCR com- 
ponent of the disparity trial, adjusted for 
reference level of the UCS involved, for 
Stoups differing in the percentage of priot 
trials reinforced. 
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Fic. 5. Comparison of conditioned heart 
rate responses for groups differing in aware- 
ness of CS-UCS relations. (Adapted from 
Lacey, Smith, & Green, 1955.) 


greater the GSR elicited by the actual 
stimulus, Discrepancies in an upward 
direction appear to yield larger dis- 
parity responses than discrepancies in 
a downward direction. In addition, 
these factors appear to be a function of 
the number of reinforcements and the 
percentage of reinforcements employed. 


APPLICATIONS OF THE DISPARITY 
CONCEPT 


Thus far the emphasis has been upon 
the definition of the disparity response 
and the exploration of some of its de- 
termining conditions. Next, attention 
is paid to application of the disparity 
situation in attempts to clarify some 
problems of human conditioning. 

One of the problems that has con- 
cerned the writer is the nature of the 
GSR acquisition “learning” curve. In 
other words, what is the effect of the 
number of trials (or reinforcements) 
on the magnitude of a conditioned au- 
tonomic response? If the conditioned 
GSR, for example, is determined by 
Interactions of different processes (i.e., 
Nonperceptual and perceptual), it would 
be predicted that at times one process 
Or the other might be dominant. One 


result could be that a “typical” learning 


curve would not be obtained: that is, 
GSR magnitudes as a function of num- 
ber of reinforcements would not be an 
increase in CR magnitude with an in- 
crease in trials. 

That different “learning” curves 
might be expected from different per- 
ceptual situations has been demon- 
strated by Lacey, Smith, and Green 
(1955). It will be noted from Figure 
5 that they found “unaware” Ss gave 
increasing heart rate CRs with more 
trials, whereas “aware” Ss showed de- 
creasing CRs with more trials. Sim- 
ilarly, Kimmel (1959), with GSR con- 
ditioning in this laboratory, found 
magnitude of anticipatory CRs to in- 
crease up to eight reinforcements, then 
to decrease. His data are given in 
Figure 6. The implication in both 
cases is not that less “learning” is 
occurring with more reinforcements but 
rather that the perceptual aspects of the 
situation are probably changing. 

If it might be presumed that per- 
ceptual set factors are stabilized by re- 
peated trials where a given CS-UCS 
relation obtains, it might be proposed 
that as long as no irregularities occur 
in the training situation S will cor- 
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rectly “set” for the noxious UCS and 
therefore will show a curve of adapta- 
tion with successive stimulation (just 
as he would to a series of presentations 
of the UCS alone). The result will be 
that anticipatory CRs and UCRs may 
both show reduced amplitude with in- 
creased trials. 3 

It might be argued further that it is 
the perceptual tendencies or sets which 
are becoming stronger with increased 
experience or reinforcements. If the 
disparity response is taken as evidence 
of perceptual set strength, it can be 
predicted from data already at hand 

(Figures 1 and 2) that the disparity 
variable (the difference between the 
response magnitudes on disparity and 
nondisparity trials) does increase with 
larger numbers of reinforcements. It 
was also found (Figure 4) that the dis- 
parity variable was greater for 100% 
than for 67% reinforcement conditions, 

The proposal here, then, is that the 

disparity response may show a differ- 
ent course of development (with regard 
to the number-of-reinforcements var- 
iable) than will a simple anticipatory 
CR. Depending upon conditions it 
may provide a “better” estimate of 
strength of preparatory set than the 
anticipatory CR does. Tn order to 
examine this possibility, consider a 
current experiment in which an attempt 
was made to émphasize the perceptual 
set properties of the conditioning envi- 
ronment, 

Taking a lead from 
Welch, and Fisichelli 
periment was designed to test the no- 
tion that reinforcing a class of cues 
intrinsically suggesting stimulation wil] 
lead to increased perceptual response 
tendency (to expect stimulation) on a 
cue not reinforced but capable of sug- 
gesting stimulation, The hypothesis 
was tested by presenting a series of 
Seven-word stimuli which were appro- 
priately reinforced during the acquisi- 
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tion period, i.e., the words “soft music 
were followed by a few bars of soft 
orchestral music, the words cool 
breeze” were followed by a brief stim- 
ulation by an electric fan, “loud tone” 
followed by actual tone, “green light 
by actual green light, etc. After either 
14 or 49 such “reinforcements’ a crit- 
ical word or neutral word was intro- 
duced in a test series which also in- 
cluded a disparity trial (cue “green 
light” followed by the actual loud 
tone). 

Since in all cases the cue was pre- 
sented for 5 seconds before the rein- 
forcing (UCS) stimulus was presented, 
it became possible to trace the course 
of anticipatory responses following any 
Particular cue. Also, since long and 
short reinforcement series were em- 
Ployed, the effect of different numbers 
of reinforcements on both the regular 
and the disparity trials could be ob- 
served. The previous development 
would lead to the prediction that antic- 
ipatory Tesponding (to the cue “loud 
tone”) would become less with more 
reinforcements, as would also the 
response to the actual tone, due to 
adaptation. On the other hand, the 
disparity response to tone following an- 
other (“green light”) cue would in- 
crease with greater numbers of rein- 
forcements, 
The results for this comparison for 
20 Ss receiving 14 reinforcements prior 
to the disparity trial and 20 Ss receiv- 
ing 49 reinforcements prior to the dis- 
Parity trial are shown in Figures 7 
and 8. The data follow the prediction 
closely, the anticipatory responses show- 
ing significant decrease from short to 
long reinforcement series (t = 2.64, df 
= 38) and the disparity responses 
showing a significant increase over the 
Same period (t = 2.23, df = 38)2 

* Essentially the same relations were ob- 
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Explanation of the above results has 
proceeded from familiar behavioral pos- 
tulates concerning reinforcement and 
perceptual sets. The concept of rein- 
forcement designated an empirical state 
of affairs with specified temporal reg- 
ularities obtaining between classes of 
stimuli (CSs followed by UCSs). 
Similarly, the perceptual set concept 
was related to such general notions 
as orientative-investigative reactions of 
Pavlov (1927), anticipatory goal re- 
sponses of Hull (1930), and the S-R 
form of Tolman’s expectancy (Mac- 
Corquodale & Meehl, 1954). It was 
later pointed out to the writer that 
experimentation on disparity variables 
using different descriptive terms had 
been carried out in the laboratory of 
Anokhin in Russia, This led to fur- 
ther considerations, as follows. 

Anokhin (1955) was interested in a 
general theory of adaptive behavior 
which would help to integrate the ac- 
tivities of physiologists and psychol- 
ogists. One of his central notions is 
that of “reverse afferentation,” a proc- 
ess of reverse signalization or feedback 
which follows each reflex act “like an 
echo” and which informs the organism 
whether the adaptive effect appropriate 
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NUMBER OF REINFORCEMENTS 
Fic. 7. Anticipatory CRs (magnitude of 


GSR during delay interval) as a function of 
the number of reinforced trials. 
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Fic. 8. Magnitudes of disparity response 
(difference between responses to the UCS on 
disparity and nondisparity trials) as a func- 
tion of the number of reinforced trials. 


to the situation has been achieved or 
not. This afferent process provides the 
necessary conditions to the organism’s 
stopping activity or passing on to other 
acts comprising the chain of adaptation. 
The notion of “reinforcement” can be 
applied to any fractional stage of adap- 
tation which has obtained correspond- 
ing reverse afferentation. He talks 
about “sanctioning afferentation” which 
stops further attempts at organization 
of new reflex acts and strengthens the 
last combination of excitation in the 
centers of the brain which gave a suc- 
cessful adaptive effect at the periphery. 

On the question of “How does the 
organism tell the sanctioning properties 
of the afferentation?” Anokhin brings 
in the concept of “intention.” To be 
effective reverse afferentation must cor- 
respond to our intention, that is, it must 
correspond to some provisional com- 
plex of excitations which arose before 
the reflex itself was formed. At the 
time of the arrival of the reverse af- 
ferent impulses from the UCS to the 
cortex the behavior will be stable only 
to the extent that the UCS excitation 
corresponds accurately to the prepara- 
tory afferent excitation which was 
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TABLE 1 
Comparison or GSR RECOVERY QUOTIENTS 


TO VARIOUS UCSs BEFORE AND AFTER 
DISPARITY EXPERIENCE (N = 18) 


Nondisparity Disparity 
UCS UCS 
Before Disparity . 
Experience A2 63 
After Disparity - 
Experience 51 63 
t ratio 2.14 
P -05 


brought to life b 
onds before the 
another way, the 
produces a conclusi 
adequacy or inade 
forcement which 
The Physiological b 
is conceptualized 
ceptor” apparatus, 
An experiment cited by Anokhin em- 
phasizes the closeness of his work to 
that being Teported from the USC lab- 


oratory. Two Secretory CRs were es- 
tablished, one a tone “la” 


forcement with tye rusk 
side of the bench and tl 
“fa” with reinforcement with rye rusks 
on the left side, During the acquisition 
phase no erroneous motor reactions 


l went right after 
“la” and left after “ a.” en, on a 


test day dry meat was Placed with the 
rusk on the left side. The result was a 
brief investigative reaction and rapid 
ingestion of the meat, after which the 
Previous motor patterns becaine dis- 
rupted. Regardless of which CS was 
given the animal went to the left, sug- 
gesting that the eating of the meat Jeq 
to a dominance of the reaction to the 
left. Further observation with Tusk- 
only reinforcements showed refusal to 
Cat. the rusk, and masked enges $ 
‘the anima?’s beh 


y the CS several sec- 
reinforcement, 
afferent apparatus 
ve evaluation of the 
quacy of the rein- 
follows the signal, 
asis for this Process 
aS a cortical “ac 


À D Y 
Avior, suggesting What t 
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Anokhin terms 
state.” 


The emphasis in Anokhin’s expel 
ment would appear to be on the = 
tive effect of disparity situations pan, 
than the basic relation between ae 
reinforcement concepts and the concep 
of sanctioning reverse aferenta aa 
However, if one considers the nila 
closely, there are numerous aga 
for extending the perceptual asa 
research described in this paper al 
least one example will be sonsido na 
“rom Anokhin’s experiment one a 
assume that disparity situations pro 
duce marked change in the cauces 
the “reinforcement” process. In io 
havioral terms the occurrence of a adis 
parity trial would alert the organism, 
lead him to expect a change in rein 
forcement relations. Tt could be argue 
that even the absence of the UCS, as a 
an acquisition test trial, may ee 
a disparity-like Situation, The rer 
would be some effect upon the acquis! 
tion Process, 

Ossibilitie 
explored by 


a “clear-cut neurotic 


S like the above can o 
further analyzing a 
from experiments already describes" 
For example, in the study discusses 
Connection with Figure 1, the dispar! A 
test situation contained two series, a" 
with disparity and nondisparity EE 
If the occurrence of one disparity To 
has a disruptive effect, there should n 
a difference jn responses to UCSs a 
nondisparity trials after a disparity “a 
Perience and responses to the sam 
Ss prior to disparity experience. e 
response variable which shows this pe 
of disruptive effect readily is the 7 
very quotient first introduced ion 
Seeman. (1948). Tt is an expressit 
the Percentage return of the st 
Conductance to its original level on 
curring within a standard latency inter 


val. Tg rih ve io GS 
Wren f w Miis Syehavior “a ald 

"pled, the teturn to base level $5 are 
© slower than when the tria 
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proceeding smoothly. Expressed in 
terms of this experimental situation one 
would predict a slower recovery from 
response to the usual UCS after a dis- 
parity experience than before. That this 
is the case can be seen from Table 1. 

A similar effect might be expected 
with magnitude of response measures, 
that is, a larger GSR would be pre- 
dicted to a given UCS after a disparity 
trial than before. In the experiment 
referred to above this did occur, re- 
sponse magnitude changing from 1.4 
micromhos before the disparity trial to 
1.7 micromhos after. A related predic- 
tion that could be tested readily from 
numerous published studies is that, if 
an interspersed acquisition test trial has 
disparity properties (as it is proposed 
here it may have), the response to the 
first UCS after the test trial should be 
different from the response to the last 
UCS prior to the test trial. With GSR 
the magnitudes should be as indicated, 
i.e., larger UCR after disparity. 

These later considerations were in- 
troduced only to suggest some of the 
extended applications the disparity con- 
cept may have for explaining certain 
classical conditioning results. Other re- 
lations are being studied currently in 
order to evaluate the utility of the con- 
cept. These extend from attempts to 
Incorporate disparity notions in a ra- 
tional model of GSR behavior (Dossett 
& Grings, 1959) to the use of the per- 
ceptual disparity response as a measure 
of individual differences, 


SUMMARY 


A set of operations for inferring the 
existence of a preparatory set variable 
in human autonomic conditioning has 
been introduced and described. Termed 
a perceptual disparity response it is de- 
fined as the difference in magnitude of 
response between situations where re- 
ceipt of stimulation following a signal 


cue is in accord with past experience 
and where receipt of stimulation is not 
in accord with past experience with the 
particular cue. Experimental data dem- 
onstrating the disparity variable are 
presented. 

The assumption that the perceptual 
disparity response would be related to 
meaningful determining conditions in 
human behavior was evaluated. It was 
found that the disparity response mag- 
nitude was a function of the number 
and proportion of reinforcement trials 
and of the amount and direction of the 
stimulus change involved. 

In an attempt to extend the applica- 
tion of the disparity concept in human 
conditioning, two further situations 
were discussed. One concerned the re- 
lation between magnitude of disparity 
response as a function of number of 
training trials, and compared this to the 
common learning function relating an- 
ticipatory CRs and number of training 
trials. The second situation evaluated 
the disruptive effect of a disparity ex- 
perience in a sequence of regular rein- 
forcement trials. 

Brief reference was made to similar 
concepts and experimental work being 
carried out at the present time in 
Russia. 
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THE PLEASURES OF SENSATION* 
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That the senses are the channels of 
communication between the external 
environment and the nervous system 
is so much a truism that, except for 
those who espouse ESP, the senses are 
held to be the sole mediators of trans- 
actions between the organism and its 
environment. But students of the sen- 
ses tend to emphasize primarily the 
nature of their information handling 
function, their discriminative capacities 
and limits as well as the physiological 
processes upon which they are based. 
Behavior theorists have given heed to 
such function in another context when 
they speak of the cue function of stim- 
uli. According to these conceptions 
discriminative stimuli serve to steer be- 
havior, to guide it or set the stage for 
the response. But in this paper I do 
not plan to elaborate upon such proc- 
esses, Rather I wish to call attention 
to certain other aspects of sensory func- 
tion. 

In recent years, neurophysiologists 
have emphasized that sensory systems 
may mediate important functions other 
than discrimination. Attention has been 
focused on such generalized effects as 
arousal or alerting of the nervous sys- 
tem as a whole. Bremer (1935) first 
showed that a state of cerebral somno- 
lence could be produced in experimental 
animals in which the cerebrum had been 
surgically isolated from the brainstem 
and its incoming afferent pathways. He 
attributed this to the interruption of 
sensory influx itself. Later analysis 
(Magoun, 1958) showed that the som- 


1 Slightly revised version of Presidential 
Address presented at the Annual Meeting of 
the Eastern Psychological Association, At- 
lantic City, April 13, 1959. 


nolence resulted largely from interfer- 
ence with the centripetal influence ex- 
erted by the reticular system, the 
ascending reticular activating or arousal 
system. Much subsequent research has 
shown that this system controls not 
only the state of arousal of the “higher 
centers” of the nervous system, but that 
it can also modulate and influence ac- 
tivity within the classical sensory path- 
ways themselves (Galambos, 1956; 
Hagbarth & Kerr, 1954; Hernandez- 
Peon, 1955). The reticular system 
may also be significant for a variety of 
other psychological processes (Linds- 
ley, 1951; Schlosberg, 1954). 

Most sense organs were found to 
have at least two central neural path- 
ways. One, the well-known primary 
projection pathway via the thalamic 
sensory relays to the cerebral cortex, 
and the second, a nonspecific pathway 
by way of the reticular activating sys- 
tem with diffuse projections to the 
cortex and other neural structures. The 
reticular system is multimodal and in- 
tramodal, for it receives inputs from 
many modalities. The classical projec- 
tion systems are specific to one modality 
and may be said to mediate primarily 
the cognitive or discriminatory sensory 
functions, the nonspecific system to 
mediate physiological and behavioral 
arousal (Magoun, 1958). 

Psychologists especially have empha- 
sized that stimuli may have still other 
functions, especially that of reinforce- 
ment (Hebb, 1958b; Keller & Schoen- 
feld, 1950; Skinner, 1953; Spence, 
1956). This concept has most often 
been discussed in purely behavioral 
terms until the more recent studies 
of direct reinforcement by intracranial 
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stimulation (Brady, 1958; Miller, 1957; 
Olds, 1958; Olds & Milner, 1954). In 
the ensuing discussion I shall limit my- 
self to the so-called primary reinforce- 
ment for there is almost no limit to the 
range of previously neutral stimuli that, 
by one method or another, can be made 
to acquire reinforcing properties, That 
primary aversive reinforcement has been 
a function of stimuli has long been 
known, particularly for those stimuli of 
high intensity which elicit defense re- 
actions-or “reflexes.” Certain of these 
seem qualitatively more prepotent as, 
for example, the pain of electric shock; 
“rat runners” have frequently used 
shock as a primary negative reinforcer 
in their mazes or lever boxes. Some 
theorists hold that the pain reduction 
by shock termination provides the pro- 
totype of all reinforcement in the form 
of drive or tension reduction (Miller, 
1959). The study of stimuli as posi- 
tive primary reinforcers has, until re- 
cently, been restricted to those stimuli 
which are naturally related to such bio- 
logically functional activities as eating, 
drinking, or sexual activity. These 
stimuli could often be assigned a role 
in processes that mediated the satisfac- 
tion of a need or “homeostatic drive,” 
Thus they were said to be related in 
some way to the primary biological 
drives. But there has been increasing 
evidence of late that sensory stimula- 
tion, divorced from its need or drive 
reducing concomitants, may function as 
a reinforcer in its own right. “Extero- 
ceptive motivation” sometimes called by 
such names as curiosity or stimulus 
change (Harlow, 1953; Hebb, 1958a ; 
Kling, Horowitz, & Delhagen, 1956; 
Montgomery, 1952) has been demon- 
strated in a number of situations, And 
one theorist (Hebb, 1958a, pp. 451- 
467, 1958b) has attempted to link such 
reinforcement to the reticular activating 
System on the grounds that changes in 
level of activation are reinforcing, per 
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se, depending upon the prevailing level 
at the time of stimulation. Thus, in 
sensory isolation experiments, “the sub- 
ject experienced great swings of moti- 
vation, which alternated between pe- 
tiods of apathy and intense desire to get 
back to a normal environment.” Hebb 
concludes, “clearly man’s motivation is 
a function of his exteroceptive stimula- 
tion.” Thus, although the reinforcing 
function of stimuli has long been recog- 
nized and is the stock in trade of many 
experimentalists, there is still consider- 
able debate as to the basis for this 
effect. Does it depend upon some sec- 
ondary effect as need reduction, drive 
reduction, or change in arousal patterns 
in the reticular activating system? Or 
can we attribute these effects to sensory 
stimulation directly ? 

In this paper I shall discuss the prop- 
Osition that Sensory stimulation per se 
together with its ensuing central neural 
events be considered as a prime deter- 
minant of reinforcement, 


In this con- 
text I shall discuss our experiments on 


the sense of taste with emphasis on the 
Sensory physiology of taste mediated 
Motivation as revealed in the correla- 
tion between gustatory nerve discharges 
and food preferences in animals. This 
will lead into a consideration of the re- 
lation between hedonic processes and 
afferent nerve discharges, preference 
behavior, and taste reinforcement, Fi- 
nally the relation between affective sen- 
Sory processes and discriminative func- 
tions will be discussed in the light of 
Some speculations on the physiological 
bases for these two aspects of sensory 


function and their significance in be- 
havior, 


Taste as A MODEL SYSTEM 


Although I began my studies of the 
sense of taste with a traditional sensory 
emphasis, it became increasingly ap- 
parent that the gustatory sense has cer- 
tain unique features for the further 
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A block diagram of the recording apparatus showing two types of record. The 


upper trace shows a typical asynchronous, multifiber discharge from a large number of nerve 
fibers; the lower trace shows how such activity appears when processed through the 
integrator. (Reproduced from the American Journal of Clinical Nutrition.) 


understanding of behavior in general. 
Not only does the sense of taste possess 
selective receptor sensitivity that per- 
mits the discrimination of different 
taste stimuli, but these stimuli will elicit 
a number of specific consummatory re- 
sponses: drinking and eating or, on the 
other hand, rejection. It is a relatively 
simple matter to demonstrate the con- 
trol of behavior that taste stimuli can 
manifest by giving the animal a choice 
of water and taste solutions as Richter 
(1942) showed in his classic studies of 
self-selection. In addition, these stim- 
uli can act as powerful reinforcers of 
instrumental responses leading to inges- 
tion. Thus one and the same sense 
modality possesses easily demonstrable 
discriminative and reinforcing func- 
tions. In this respect it is an ideal 
model sensory-behavioral system in 
which to examine the relation between 
sensory stimulation and the reinforcing 
mechanisms in behavior as well as to 
Permit the manipulation of the state 
of the organism by certain deprivation 


operations. I should, therefore, like to 
review briefly some of the methods we 
have employed and some of our experi- 
ments on taste,? to show how we look 
at the sensory continuum and its af- 
ferent neural input to the CNS which 
both motivates and directs behavior. 
In so doing I shall make use of the 
behavioral data of other workers, as 
well as the results of our own phys- 
iological and behavioral experiments. 
Our behavioral studies to date have 
utilized primarily the preferential inges- 
tion method. 


The Gustatory Afferent Discharge 
and Preference 


Afferent nerves, like all nerves of the 
body, carry a series of electrical pulses 
which are the signs of impulse traffic 
up the sensory nerve. These can be 

2 These experiments have been supported 
in part by projects and grants from the 
Office of Naval Research, National Science 


Foundation, and the General Foods Corpo- 
ration, g 
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Fic. 2. Curves of taste responses in the 
cat to four different taste stimuli as indicated 
by the integrated response method. (Repro- 
duced from the Journal of Neurophysiology.) 


recorded by appropriate electronic de- 
vices as shown in Figure 1 (cf. Pfaff- 
mann, 1959). I shall present only data 
for the over-all activity in the taste 
nerve as afforded by the integrating 
circuit illustrated in the lower trace 
in Figure 1. The gustatory response 
curves for the cat are shown in Figure 
2 in terms of the magnitude of the elec- 
trical signal generated in the nerve 
when each of the basic taste stimuli is 
applied to the tongue receptor surface. 
By such curves we can map out the 
taste sensitivity of different animals 
(Pfaffmann, 1955). These curves do 
not tell us about “qualitative” differ- 
ences in the nerve response, For that, 
analysis of the activity of single nerve 
fibers in the nerve js required (Pfaff- 
mann, 1959). 

From the point of view o 
the basic response to taste 
either one of acceptance or rejection so 
that the classical manifold of four tastes, 
salt, sour, bitter, and Sweet, may be 
reduced to two behavioral classes: ac- 
ceptance and rejection. We have used 
a typical two-bottle Richter-type pref- 
erence situation to study such behavior 


in different species (Bare, 1949; Car- 


f behavior, 
Solutions is 
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penter, 1956; Pfaffmann, 1957). Cer- 
tain substances may be accepted at low, 
but rejected at the higher concentra- 
tions. This is so for NaCl in the rat 
where the well-known preference-aver- 
sion response can be demonstrated to 
be a function of concentration as shown 
in Figure 3. The solid line shows 
relative preference behavior 


ce, intake taste sol. 
ce, intake taste + cc. intake H,O 


Most acceptable solutions including 
sugar will follow this pattern when 
ingestion measures are used. Figure 3 
also shows the neural response in the 
rat chorda tympani nerve to the same 
salt solutions. The broken line shows 
the integrated neural responses as a 
percentage of the maximum response 
observed. Clearly the preference be- 
havior (i.e. Tesponse greater than 
50%) is not apparent until some sub- 
stantial neural input has been achieved 
(20%-30% of maximal discharge). 
The peak preference occurs at about 
60% of maximal input, and the aver- 
sion rapidly sets in while the afferent 
discharge is still climbing. There is no 


change in the afferent input signal that 
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Fic. 3. Preference response to different 
salt concentrations and magnitude of the 
chorda tympani nerve discharge in the rat- 
Ordinate to the left corresponds to prefer- 
ence response, ordinate to the right shows 
magnitude of integrator deflection. (Repro 
duced from Pfaffmann, 1958.) 
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corresponds to the behavioral inversion 
point—at least we have not been able to 
detect any such change. Up to the in- 
version point, the behavior parallels the 
afferent discharge curve as if the posi- 
tive approach to the stimulus were due 
directly to the afferent input. The in- 
version and the subsequent aversion 
suggest the intrusion of a secondary 
“stop” system, The fact that the in- 
version point is close to the isotonic 
point for sodium chloride solutions 
(.9% NaCl [.15M] is isosmotic with 
the body fluids) suggests that some 
postingestion or metabolic factor re- 
lated to water balance has become 
operative. 

Indeed Stellar, Hyman, and Samet 
(1954) have shown that the salinity of 
the gastric contents does affect the salt 
preference. Strong salt solutions in- 
tubated directly into the stomach de- 
press the drinking of the more concen- 
trated salt solutions. On the other 
hand, this is not the only stop factor 
for, in the same study, animals with 
an esophageal fistula showed the pref- 
erence for hypotonic solutions and aver- 
sions for hypertonic ones even though 
the solutions never reached the stomach. 
Taste factors alone appear capable of 
eliciting the typical salt preference- 
aversion function. 
indicate a change in “sign” of the af- 
ferent input merely as a consequence 
of its increasing intensity. Perhaps 
there is a central-neural switching when 
the intensity of the afferent salt dis- 
charge reaches a critical value. Most 
intense sensory stimuli have aversive 
effects. z 

The relation of behavior to sensory 
afferent discharge for some of the other 
basic taste stimuli is shown in Figure 4. 
The upper figure shows the preference 
response curves for sugar, acid, and 
quinine as well as salt. We show here 
only the intakes above 50% for salt and 
sugar, remembering that at higher con- 
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Fic. 4. Composite graph of behavioral 
and electrophysiological responses in the rat. 
Upper graphs show the percentage preference 
(or aversion) as a function of stimulus con- 
centration, The integrator responses in the 
chorda tympani nerve to the same stimuli are 
shown in arbitrary units in the lower graph. 


centrations, the curves fall below the 
50% value. The responses for acid 
and quinine are both aversions; no 
preference is shown. Note in the lower 
curves that the absolute magnitudes of 
neural response are quite different for 
the electrolytes as compared with the 
nonelectrolytes. The behavioral re- 
sponse to quinine is quite definite and 
appears before there is a clear signal 
“greater than the noise” for the nerve 
response. Here the behavioral indi- 
cator is the more sensitive. In an 
analogous way, the response to sucrose 
behaviorally is clear and definite, yet 
the neural response is disappointingly 
small. Intrinsic differences in the mag- 
nitude of the-neural response, due in 
part to fiber size, etc., mean that sheer 
size of electrical signal itself cannot be 
correlated with behavioral effect. On 
the other hand, the range of effective 
stimulus concentration values in the 
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two sets of determinations for each 
stimulus shows good agreement. The 
exact form or position of the preference 
aversion curves along the abscissa may 
be shifted depending upon such factors 
as order of presentation of stimuli, de- 
gree of hunger or physiological need, 
and amount of experience with the taste 
stimuli. Itis the general nature of the 
relations between sensory input and 
preference that is shown in Figure 4, 
The behavioral response to sugar 
solutions has been studied rather exten- 
sively, and it now seems possible to give 
an account of the start and stop factors 
in relation to sensory input. The pref- 
erence curve just shown is based on 
ingestion of sugar over a 24-hour or 
longer period. Ingestion is a rising 
function of concentration to a peak pref- 
erence when the typical inflection oc- 
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toG % conc. 
Fic. 5. Composite graph 
neural response to sucrose and t 
measures obtained by three di 
dures in the rat. Pref. is the typical two- 
bottle 24-hour ingestion preference method, 
Both the sucrose and water intakes are 
shown as smoothed curves, based on data by 
Richter and Campbell. RR is the rate of bar 
pressing during a test period for a small dro 
of sucrose on a regular reinforcement sched- 
ule, and PR is the same on a periodic rein. 
forcement schedule, both based on Guttman 


(1953), Ordinates have been adjusted to 
make the maximum values in the origina} 
curves equal to 100, 


showing the 
he behavioral 
fferent proce- 


curs and intake declines. This relation 
is found also in the single stimulus 
method. McCleary (1953) was one of 
the first to show that the intake of sugar 
solution is limited by an osmotic post- 
ingestion factor. Others have shown 
similar effects of gastric factors by 
studying the effects of intubation upon 
intake (LeMagnen, 1955; Miller, 1957; 
Smith & Duffy, 1957; Stellar, Hyman, 
& Samet, 1954). The relative fre- 
quency of choice in the brief exposure 
preference method, however, does not 
show the inflection (Young & Greene, 
1953). Here there appears to be a 
linear relation between the level of ac- 
ceptability and logarithm of concentra- 
tion. ` Guttman’s (1953) study of the 
rate of bar Pressing as a function of 
concentration of sucrose solution used 
as the reinforcement showed that, on a 
continuous schedule, rate increased for 
the weaker ¢ 
an inversion at the higher value, Ona 


ver, the rate of 
und to be linear with 
g of the Concentration. The latter 
schedule provides relatively little drink- 
ig per response. But bar press rate 
also can be depressed by intragastric 
injections (Smith & Duffy, 1955). 
The relations between bar pressing 
on the two schedules and the preference 
ingestion data of Richter and Campbeli 
(1940) are shown in Figure 5 together 
with the clectrophysiological response 
curve for the rat (Hagstrom & Pfaff- 
mann, 1959), Note that the inversion 
point in the two-bottle preference test 
is close to the inversion point of the con- 
tinuous Teinforcement schedule. Both 
Points lie close to the top of the electro- 
Physiological sensory function. On the 
periodic Schedule, rate rises as the sen- 
Sory function increases. The fact that 
there is no inversion where the amount 
ingested is small appears to implicate 
the Postingestion factor as a primary 
stop mechanism (Collier & Siskel, 
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three sugars for the rat chorda tympani 
nerve. Ordinate is relative magnitude, 100 
equals the response to 0.01 M NaCl. (Re- 
produced from Journal of Comparative and 
Physiological Psychology.) 


1959; Shuford, 1959). Such a formu- 
lation would agree with Young’s (1959) 
statement “that sugar solutions are bas- 
ically hedonically positive for the rat as 
compared with salt which is first posi- 
tive but then negative.” 

Studies which compared the effect of 
two different sugars, glucose and su- 
crose, on consummatory behavior, choice 
response, and rate of bar press likewise 
point to the importance of postinges- 
tive factors. Isohedonic concentrations, 
equally accepted in short comparative 
tests, were not consumed in equal 
amount when presented singly for 
longer periods. The glucose solutions 
were consumed in less volume than 
their isohedonically matched sucrose 
solutions, and the cumulative mean in- 
take of both sugars was linearly related 
to osmotic pressure for the longer pe- 
riods (Shuford, 1959). In the Skinner 
box the concentration of these two sug- 
ars which give equal rate of response 
on an aperiodic schedule (i.e., have 
equal reinforcing value) corresponds to 
the isohedonic concentrations (Shuford, 
1959; Young, 1957). Guttman (1954) 
Previously showed that the equal re- 
inforcing solutions correspond to the 


equally sweet concentrations found in 
psychophysical experiments (Cameron, 
1947 ; MacLeod, 1952). 

Hagstrom and I (1959) have also 
compared the relative efficacy of sucrose 
and glucose as gustatory stimuli for the 
rat, using the electrophysiological re- 
cording method. This could be done 
even though the magnitude of response 
to sugar is relatively small. Figure 6 
shows the relative responses to the two 
sugars, and Figure 7 shows the com- 
parison of the equi-effective concentra- 
tions of the two sugars as determined in 
the bar pressing experiment, in the 
preference choice test, and by the elec- 
trophysiological measure. All measures 
agree that sucrose is more effective than 
glucose of equimolar concentration, al- 
though the quantitative relation is not 
precise, for the electrophysiological data 
is curvilinear, the behavioral data, lin- 
ear. However, it should be remem- 
bered that the physiological data re- 
flect only the chorda tympani response 
and not the taste receptors at the back 
of the mouth. Perhaps the curvilinear 
relation between the behavioral and 
physiological measures arises from this. 

Where ingestion is minimal, behavior 
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Fic. 7. Equal response concentrations of 
glucose and sucrose for electrophysiological 
measures, equal bar press rates, equi-sweet 
solutions and isohedonic solutions. (Repro- 
duced from Journal of Comparative and 
Physiological Psychology.) 
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and sensory effectiveness seem to „80 
together. Actually the postingestion 
factor seems to be especially significant 
in preferences based on ingestion. The 
glucose preference threshold is lower 
than that for sucrose and the intake of 
glucose greater than that predicted 
by the sensory measures. Soulairac 
(1947) has shown that the relative 
preference for sugars in a typical two- 
bottle preference test is correlated with 
their relative rates of absorption from 
the gastrointestinal tract. 

Experiments utilizing other learning 
situations have been concerned with 
the action of sugar or saccharin as 
reinforcers (Collier & Siskel, 1959; 
Hughes, 1957; Sheffield & Roby, 1950; 
Sheffield, Roby, & Campbell, 1954; 
Smith & Duffy, 1957: Young & Shu- 
ford, 1954, 1955). There is ample 
evidence that the sweet taste is re- 
warding whether it is nutritive or non- 
nutritive. Performance here too is an 
increasing function of stimulus con- 
centration where conditions appear to 
maximize the sensory effects and min- 

imize the postingestion factor. When 
amount or temporal factor is such that 
the absolute amount or volume of stim- 
ulus or nutrient ingested per unit time 
is low, then purely stimulus factors 
seem best to account for the results, 
When the amount consumed per unit is 
large, either because of a large amount 
of reinforcing agent or short time þe- 
tween presentations, then the secondary 
postingestive factors intrude, As Col- 
lier and Siskel (1959) point out, it 
should be possible with the Proper com- 
bination of concentration, volume, inter- 
val between reinforcement, and number 
of reinforcements to obtain a Monotonic 
stimulus function, a nonmonotonic func- 
tion with inversion points at different 
volumes or concentrations, or even no 
relation at all between amount of rein- 
forcement and performance, 


These considerations presumably ap- 
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ply to Campbell’s (1958) mesur 
ments of the JNR, the just noticeable 
reinforcing difference for sucrose solu- 
tions. He determined the minimal con- 
centration increment necessary to pro- 
duce a 75% preference for the stronger 
of a pair of sucrose solutions in a two- 
bottle preference situation. Stronger 
solutions required a larger concentra- 
tion increment than did the weak ones, 
and the plot of JNR against stimulus 
concentration showed a U-shaped func- 
tion reminiscent of the more familiar 
Weber intensity discrimination func- 
tions. The resemblance may be due 
only in part to the properties of the 
Sensory input because, as we have just 
shown, preference is the composite of 
Sensory as well as postingestion factors. 
The osmotic effects of the stronger 
sugar solutions might be particularly 
strong. This situation contrasts with 
the JNR functions for the noxious 
stimuli where the stimulus itself may 
be more directly the source of the rein- 
forcement uncomplicated by secondary 
factors (Campbell, 1958), 

hus we see that sensory stimulation 
he mouth receptors, especially of 
taste, has a direct relevance for the 
control of ingestive behavior and the 
instrumental responses which lead to 
ingestion, Although a number of in- 
vestigators have provided evidence that 
the mouth receptors may be bypassed 
and that learning can still take place, 
all such evidence shows that such learn- 
ing is not as effective as when stimula- 
tion of the mouth receptors is included 
(Coppock & Chambers, 1954) ; Miller, 
1957), Nearly all workers, whatever 
their theoretical predilections, have 
shown the importance of stimulation 
and “sensory contact,” some would say 
“sensory satisfaction” (Smith & Duffy, 
1957) in the reinforcement process. 
Indeed as Bindra (1959) concludes in 
his recent Monograph on motivation: 
“Whatever the interpretation, it seems 
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clear that, up to a certain point, an in- 
crease in sensory stimulation is a posi- 
tive reinforcer” (p. 134). 

The fact that certain taste stimuli 
control ingestion directly appears to be 
biologically determined. Frings (1946) 
has pointed out that nearly all organ- 
isms accept sugar solutions. There are 
exceptions perhaps related to certain 
aspects of metabolic or other biochem- 
ical divergences among the species. 
Cats cannot taste or discriminate sugar 
solutions (Carpenter, 1956; Pfaffmann, 
1955), and birds (Pick & Kare, 1959) 
do not appear to show strong sucrose 
preferences; but these examples are re- 
markable largely for their divergence 
from what otherwise appears to be a 
general rule. Further there is no con- 
vincing evidence that the “sweet tooth,” 
where it does exist, depends upon the 
concomitant nourishment. The drink- 
ing of nonnutritive saccharin solutions 
under prolonged exposure to them 
shows no sign of extinction such as 
might be expected if the preference for 
saccharin or “sweet” were acquired 
by past association with nourishment 
(Sheffield & Roby, 1950). Here we 
have “sweet for sweet’s sake.” 

On the other side of the coin, we 
have been able to show that a “bitter” 
aversive stimulus can be made more ac- 
ceptable only temporarily when it is 
paired with the alleviation of thirst in 
early infancy (Warren & Pfaffmann, 
1958). Newborn guinea pigs were 
raisel on a normally avoided solution 
of sucrose octaacetate (SOA) as the 
only source of water for a three-week 
period. At an older age these organ- 
isms showed the usual rejection of 
SOA. Gustatory stimuli, therefore, ap- 
pear to be biologically determined 
as the instigator of consummatory or 
avoidance responses and as primary 
positive or negative reinforcing stimuli, 

So far I have discussed the mech- 
anisms of a variety of behaviors which 
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Fic. 8. The preponderance of “pleasant” or 
“unpleasant” judgments in relation to the 
concentration of taste solution. Ordinate 
gives percentage “pleasant” minus percentage 
“unpleasant.” The abscissa is proportional 
to concentration, the full length of the base 
line standing for 40% cane sugar, 1.12% tar- 
taric acid, 10% NaCl, and 0.004% quinine 
sulphate (by weight). (Reproduced from 
Pfaffmann, 1958.) 


appear to be under the control of sen- 
sory stimulation qua “sensory stimula- 
tion.” I have not discussed the ques- 
tion of the affective or hedonic aspects 
of sensory stimulation, the pleasure of 
sensation, so to speak. To do so, I 
should now like to turn to other kinds 
of data derived from studies of man. 


Hedonic Aspects of Sensory 
Stimulation 


Of all the applied psychophysical 
fields, none have made greater use of 
the affective or hedonic rating scale 
methods, along with purely sensory 
testing procedures, than has the field 
of flavor technology. In a series of 
carefully controlled tests the Army 
Quartermaster Food Acceptance Lab- 
oratory (Pilgram, 1957), using a nine- 
point hedonic rating scale, has been able 
to predict, with good reliability, the 
actual choices of food and the accept- 
ance of menus on the part of soldiers 
in the field. These ratings frankly ask 
such questions as the following : 
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like extremely—like very much—indif- 
ferent—dislike very much—etc. 


In this case the hedonic ratings are not 
determined solely by stimulus proper- 
ties of food, for other studies by the 
Quartermaster group have documented 
the important role played by familiarity 
and past experience. None the less, the 
frankly hedonic rating initiated by the 
sensory stimulation is a good predictor 
of actual acceptance and ingestion. 
I have been impressed by the ap- 
parent similarity of our rat preference 
curves with those of hedonic value 
obtained many years ago by Engel 
(1928). In those experiments, sub- 
jects were asked to rate different inten- 
sities of taste stimuli as either pleasant, 
unpleasant, or indifferent. The data 
can be treated in a number of ways, but 
in Figure 8 we see the ratings of four 
different taste modalities expressed as 
percentage of pleasant ratings minus the 
percentage of unpleasant ratings for a 
group of seven observers using summed 
ratings of all observers. The abscissa 
is proportional to the concentrations ad- 
justed for each of the different stimulus 
solutions. Note that sugar begins at a 
slightly unpleasant value and rises with 
concentration to reach a plateau. Sour, 
bitter, and salt all start from indiffer- 
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Fic. 9. Comparison of animal preference 
and hedonic ratings of man in response 


to sodium chloride solutions. (Reproduced 
Írom Pfaffmann, 1958.) 
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ence, rise to a peak, and then fall off 
to unpleasantness. Sweet is predomi- 
nantly pleasant, bitter predominantly 
unpleasant, with salt and sour inter- 
mediate. In Figure 9 I have plotted 
the animal preference curve and the 
hedonic rating by Engel’s Ss for so- 
dium chloride solutions by concentra- 
tion. The hedonic ratings here were 
computed simply as the percentage of 
the total ratings that are pleasant. Ex- 
cept for position along the abscissa, the 
two curves show a striking similarity. 
Beebe-Center (1951) earlier called at- 
tention to these same relations and com- 
pared the animal preference responses 
with hedonic ratings. Actually the 
more recent analysis of the postinges- 
tive factors that control the intake of 
sucrose solutions has done much to 
clarify the discrepancy between Engel’s 
hedonic curve for Sugar and the rat’s 
preference for sugar. The sucrose 
hedonic curve does not turn down at 
the higher concentrations (see Figure 
8). As noted earlier, rate of bar press- 
ing on a periodic schedule and fre- 
quency of choice in the brief exposure 
test are likewise monotonically related 
to concentration. Thus hedonic rating 
and reinforcement bear the same rela- 
tion to stimulus concentration, 

In the preference-aversion curves for 
other substances, the stop mechanism 
may be sensory (gustatory) in origin 
or might arise from other than post- 
ingestive effects, If the taste of strong 
Salt solutions is aversive per se, then we 
might expect that, regardless of rein- 
forcement schedule, response will fall 
off above the optimal salt concentration 
in a manner resembling the “falloff” in 
Engel’s curves, The same might hold 
for saccharin which has a “bitter” sen- 
Sory component at the higher concen- 
trations. Further study of these effects 
is needed. 

I have been emphasizing stimulus 
Properties as prime determinants of the 
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hedonic effect, but let me hastily point 
out that specific training and experi- 
ence may make some unpleasant odors 
and tastes acceptable or even preferred. 
I am reminded here of the whiskey 
drinker’s development so aptly de- 
scribed by Brown (1955). 


Straight whiskey, when first ingested, typ- 
ically effects rather violent defense reactions. 
Because of this, the novice drinker usually 
begins with sweet liqueurs, “pink ladies,” and 
wines, and slowly works his way through 
a series of beverages characterized by the 
gradual disappearance of cola and ginger ale 
additives. Finally, only plain water or even 
nothing need be mixed with the raw product. 
To the hardened drinker, straight whiskey 
does not taste bad—not bad at all! (It is 
thus that a product euphemistically labelled 
“neutral spirits” becomes indeed psycholog- 
ically neutral). 


Although this description appeared in 
defense of a drive reduction formula- 
tion of behavior and learning, the mere 
fact that drive reduction was supported 
and buttressed by the use of sweet and 
“pleasant” stimuli seems to have been 
overlooked. There is no convincing 
evidence that the primary relation be- 
tween hedonic tone and sensory stim- 
ulation is entirely acquired. Indeed, 
quite the converse seems to be true. 
The relation of sensory stimulation 
to hedonic process has been noted by 
earlier workers. Sherrington (1906) 
noted that the stimulation of contact 
receptors, as contrasted with distance 
receptors, is characterized by strong af- 
fective tone and that the contact re- 
ceptors stand in very close relation to 
consummatory responses. Stimulation 
of the touch receptors of the lips and 
touch and taste in the mouth initiates 
reflex movements that precede the act 
of swallowing. Troland (1928) spoke 
of three classes of stimulus reception : 
nociception, beneception, and neutro- 
ception. The principal nociceptor sys- 
tem was pain, but other examples were 
hunger, thirst, the taste of bitter, strong 


salts and acids, and certain foul or 
repugnant odors, etc. In the beneceptor 
class were the sense organs mediating 
erotic behavior, the taste of sugar, and 
certain other odors. Vision, hearing, 
touch, and proprioception were rele- 
gated to neutroception. Although Tro- 
land's criterion was that of biological 
utility, Young (1936) pointed to the 
high correlation between pleasantness 
and beneception, unpleasantness and 
nociception. He also notes that there 
is a frequent correlation between pleas- 
antness and approach, and unpleasant- 
ness and avoidance ; and further that the 
affective responses to simple colors and 
tones are much weaker than those 
evoked by odors, tastes, cutaneous or 
organic stimuli. 

In his work on palatability, Young 
espouses a clearly hedonistic theory of 
reinforcement in which he treats the 
affective process as a postulate, an in- 
tervening variable (Young, 1959). The 
reactions of animals to taste solutions 
then can only be described when refer- 
ence is made to their positive or nega- 
tive hedonic effects. Thus, sugar solu- 
tions are hedonically positive, and the 
level of acceptability is directly propor- 
tional to the logarithm of the concentra- 
tion. Salt on the other hand does not 
show this one-to-one correspondence 
with sensory intensity. Low concen- 
trations of salt are said to be hedonically 
positive for the rat, but at high concen- 
trations are hedonically negative. Such 
terminology often appears simply to 
rename approach or avoidance behavior 
with hedonistic synonyms for we all 
know what we mean when we say in 
laboratory jargon that the rat likes 
sugar. We know he will take it in 
preference to water, he may press a bar 
for it, he may take it even when it is 
adulterated with something he “doesn’t 
like” such as quinine. But such usage 
does not necessarily indicate the prim- 
acy of the hedonic process; we must 
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Fic. 10. Lateral and medial surfaces of 
brains of rabbit (A), cat (B), and monkey 
(C) drawn roughly to scale. The drawings 
illustrate that the limbic lobe, represented in 


black, forms a common denominator in the 


brains of all mammals. (Reproduced from 
Journal of Neurosurgery.) 


have further experimental study of ac- 
ceptance and rejection and of their rela- 
tion to hedonic or affective processes. 

Thus it is abundantly clear that, in- 
stigation of consummatory response (or 
of rejection), reinforcement of instru- 
mental responses, and elicitation of he- 
donic effect are all closely related and 
that reproducible stimulus functions can 
be demonstrated for each. A definitive 
choice as to which of these is primary 
cannot yet be made. I would like to 
propose that sensory stimulation per se 
together with its ensuing central neural 
events be considered as a prime deter- 
minant in the chain of events culminat- 
ing in acceptance behavior, reinforce- 
ment, and hedonic effect. The further 
study of such stimulus functions and 
their analysis particularly at the phys- 
iological level is a problem that should 
merit the highest priority, 

I should like to turn to the possible 
physiological mechanisms in the rein- 
forcing and hedonic functions of stimuli, 


Tue PHYSIOLOGY or REINFORCEMENT 
AND SENSORY Arrrcr 


The studies of cranial self-stimulation 
(Brady, 1958; Olds, 1958; Olds & 
Milner, 1954) show that the reward 
and punishment systems within the 
brain itself, but particularly the reward 
systems, have been found to depend in 
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large measure upon the limbic system. 
In classical neuroanatomy this was 
known as the rhinencephalon or smell 
brain, but its relation to olfaction seems 
less and less particular. The limbic 
lobe including the hippocampus com- 
pletely surrounds the hilus of ihe 
hemisphere. Its various subcortica 
cell stations include the amygdala, 
septal nuclei, the hypothalamus, ante- 
rior thalamic nuclei, etc. The limbic 
lobe of the rabbit, cat, and monkey is 
shown in black in Figure 10. The 
limbic lobe appears to form a common 
denominator in the brains of all mam- 
mals. 


Its constancy of gross and microscopic struc- 
ture throughout the phylogeny of the mam- 
mal contrasts strikingly with the mushroom- 
ing neopallium which surrounds it... . Papez 
theorized that the experimental evidence also 
points to the limbic lobe as a cortical com- 
mon denominator for a yariety of emotional 


and viscerosomatic reactions in the mammal 
(MacLean, 1954). 


There is increasing evidence of a sen- 
Sory viscerosomatic influx into the 
limbic system, In pathological involve- 


-ments, particularly of the temporal lobe, 


discharges in this part of the brain may 
be associated with a wide variety of 
auras involving all the body senses as 
well as a great number of feeling and 
emotional states. Among the more 
purely sensory auras are crude olfac- 
tory sensations; alimentary symptoms 
including taste, thirst, hunger, nausea; 
and somatic sensations ranging from 
pains to paresthesia, Prolonged rhy- 
thmic responses can be elicited in the 
pyriform area by olfactory, gustatory, 
and painful stimulation (MacLean, 
1954). MacLean notes that all three 
of these senses accent the quality and 
intensity of a stimulus rather than its 
Spatial relationships. J 

I do not wish to give the impression 
that this system is essentially one for 
viscerosomatic sensation, for it is also 
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implicated in a variety of complex ef- 
fects such as the Kliiver-Bucy syn- 
drome (1958), memory losses (Pen- 
field, 1958), the lowering of rage 
thresholds in some cases or increased 
docility after ablation in others (Bard 
& Mountcastle, 1947; Green, 1958; 
Schreiner & Kling, 1953; Spiegel, Mil- 
ler, & Oppenheimer, 1940), and a va- 
riety of cranial self-stimulation effects 
(Brady, 1958; Olds, 1958; Olds & 
Milner, 1954). The details of these 
relations are beyond the scope of the 
present discourse. What I wish to 
emphasize is that there are significant 
sensory inputs into this system which 
may relate to the hedonic and reinfore- 
ing features of stimulation as compared 
with either the cognitive or arousal 
functions. 

Stellar and his colleagues (Sprague, 
Chambers, Stellar, Liu, & Robson, 
1958; Stellar, Chambers, Liu, Levitt, 
& Sprague 1958) at the University of 
Pennsylvania have made some interest- 
Ing studies of chronic animals with ex- 
tensive lesions in the lateral lemniscal, 
Primary sensory pathways. I was 
Struck with their description of the 
lemniscal animals, i.e. the sensorially 
restricted animals displayed little affect 
in any situation, except, perhaps the 
Most extreme. They flex and tend to 
Pull away from a pinch, but do not 
attack or show any autonomic response. 
They showed little or no aversive reac- 
tion to an ether cone, although they 
acrimated and sneezed. Prior to the 
Operation, they solicited petting and re- 
Sponded to it well ; afterwards they gave 
NO reaction to petting. 

These observations suggest that the 
affective response to sensory, stimuli 
might be mediated by the lemniscal and 
not the diffuse arousal system, which, 
it might be noted, already seems to have 
been theoretically overworked by psy- 
chologists. I would like to remind you 
Of the “thalamic syndrome,” described 


by Head and Holmes (1911). Lesions 
in the region of the thalamus are char- 
acterized by “overreaction,” by exces- 
sive affectivity, for both pleasurable as 
well as painful sensations. This is 
pathology characterized by the affective 
modification of sensation. Although the 
“syndrome thalamique” is well known 
to clinical neurologists, its exact mech- 
anism is still not clear. For our pur- 
poses, it is sufficient to note that in this 
condition, the basic hedonic responses 
to sensory stimulation, some of the 
“Pleasures of Sensation” as well as 
the displeasures, can be unmasked. In 
short, I am suggesting that the affec- 
tive consequences of sensory stimula- 
tion are mediated by processes that 
depend upon the primary projection 
systems and their ramifications in 
thalamic and old brain neural connec- 
tions. Obviously much work is re- 
quired to place these speculations upon 
a more solid foundation. 


SUMMARY 


And now let me briefly summarize. 
I alluded briefly to the different roles 
of stimuli and sensory processes in the 
behavioral economy of the organism. 
I pointed out that traditionally sensory 
processes were studied largely in rela- 
tion to discrimination or so called cog- 
nitive functions. But stimuli have been 
shown to have other neural and behav- 
ioral functions as arousal and reinforce- 
ment. I then went on to discuss in 
further detail certain of our own ex- 
periments on taste as a model system to 
show the relation of gustatory stimula- 
tion to the control of consummatory re- 
sponses and the reinforcement of in- 
strumental behavior. It was shown 
that there is increasing support for the 


‘idea that gustatory stimulation per se is 


eliciting and reinforcing be- 
ts own right. We might say 
I then led 


capable of 
havior in i : N 
“sweet for sweet s sake. 
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into the problem of hedonic or oe 
responses to such stimuli; and, althoug! 
a good correlation between affective 
response and reinforcement could be 
demonstrated, a statement as to their 
exact relation must await further study, 
particularly at the physiological level. 
Finally, I speculated as to the psycho- 
physiological mechanisms that might 
underlie reinforcement and the affective 
responses to sensory stimuli. My basic 
theme has been that sensory stimulation 
“qua stimulation” plays a significant 
role in the motivation as well as guid- 
ance of behavior—euphemistically we 
might say, in controlling behavior for 
the “Pleasures of Sensation.” 
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THEORETICAL NOTES 


THE CONCEPT OF STATISTICAL SIGNIFICANCE AND THE 
CONTROVERSY ABOUT ONE-TAILED TESTS 


H. J. EYSENCK 


Universit. 


Fae ral controversial papers regarding 
of uses and abuses of the one-tailed test 
(B Significance have recently appeared 
fee 1953, 1954; Goldfried 1959; Hick 
joz; Jones 1952, 1954; Marks 1951, 
1953). ` As Goldfried (1959) points out, 
“the important question debated is not if 
i should be used, but rather when it 
pona be used.” It is suggested here 
per most of the disagreements emerging 
1 m this controversy stem from a misun- 
Serstanding of the term “significance,” 
Ne it is further suggested that the same 
cus derstanding runs through many dis- 
ae of two-tailed tests as well. It 
whi pies suggested that in the sense 1n 
Ms Goldfried’s statement is meant, it 
as been the wrong question which has 
am debated; neither one-tailed nor two- 
ee ed tests should be used at all in the 
nse envisaged by most of the writers 
quoted, 
tet outcome of the statistical examina- 
stat of experimental results is always 
om in terms of the probability of dis- 
set mation of the null hypothesis; te 
al of values which these P values can 
tke is continuous in the interval from 
t to 1. It is customary to take arbitrary 
Values, such as .05 and .01, and use 
‘em to dichotomize this continuum into 


a $ is Co p 
Stgnificant and an insignificant portion. 
advantage, if 


Te, habit has no obvious 
hat is intended is merely @ restatement 
cae probability values Hee ee al ' 
meee given in any case an ; ee Me 
State, precise than a simple dichoton 
ement, Indeed, gross absurdities re- 
Ut from taking these verbal statements 


Spee iffer between a 
usly; the difference DNC ii 

C.R. of 1.90 and another of 2.00 iS quite 
one side 0 


Negli 3 

gible, yet one falls on 

the dichotomy, the other on the other 
This has led to such summary 


Ide, 


y of London 


statements as: “almost significant,” or 
“significant at the 10% level.” If the 
verbal dichotomous scale is not satis- 
factory—as it clearly is not—the answer 
surely is to keep to the continuous $ 
scale, rather than subdivide the verbal 
scale. 

However, surplus meaning has accrued 
to the word “significant,” and it has be- 
come a shibboleth which divides the suc- 
cessful from the unsuccessful research. 
Tt is frequently interpreted as almost 
guaranteeing reproducibility of results, 
while failure to reach significance is in- 
terpreted as disconfirmation. Hence the 
urgent desire to achieve respectability and 
significance by one-tailed tests, if need 
be, and the argument regarding when the 
cachet of “significance” can be bestowed 
upon a research result. Yet the argu- 
ment, and the achievement or nonachieve- 
ment of significance, do not alter the 
facts of the case, which are contained in 
the statement of the p value of the re- 
sults. Anything beyond these facts de- 
pends upon interpretation, and is sub- 
jective; it does not alter the facts of the 
case in the slightest. 

As an example of the necessity of inter- 


pretation, consider the a priori proba- 
bility of the conclusion. Suppose that an 
e carried out with 


experiment on ESP wer } 
all the precautions which human ingenu- 
ity can devise, so that even the most 
sceptical had to agree that no fault could 
be found with the experimental design. 
Suppose also that a p value of 05 were 
achieved. Would this be considered “sig- 
nificant,” in the sense of guaranteeing 
reproducibility ? Critics would point out 
quite rightly that where the a priori 

q very low, as in this case, 
values would be required 


probability is 
much higher > a 
ificance. Logicians are 


to carry sis? 
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at interpretation of experimental 
ae cet all on all available ‘nowl- 
edge about the subject in Tenon & 
priori probability is a kind of summary 
statement of much of this knowledge. It 
cannot be overlooked in arriving at a 
conclusion regarding “significance” when 
the term carries the surplus meaning in- 
dicated. i 2 
That interpretation comes jnto the 
problem very much is clear when we look 
at such conditions as those suggested 
by Kimmel (1957) as criteria for the 
use of one-tailed tests. He suggests, for 
instance, that they may be used if re- 
sults in the opposite direction would be 
psychologically meaningless or could not 
be deduced from any psychological theory. 
These are obviously not objective cri- 
teria, but depend on what the author (or 
reader) considers psychologically mean- 
ingless, or the kind of theory he may 
hold. Opinions will differ, and conse- 
quently some readers will agree to the 
use of the one-tailed test in a particular 
case, others will not. Thus to some read- 
ers the results will appear significant, to 
others insignificant, 
The whole argument seems to be about 
words, not about facts: Is the word “sig- 
nificant” to be used in a given situation, 
i is would only matter if 
some objective mean- 
in the probability fig- 
ures; we have argued that it does carry 
surplus meaning, but that this is not of 
an objective kind. Consequently, nothing 
important is changed by omitting the 
term altogether in the report, leaving in- 
terpretation to the reader. After all, the 
only true proof of reproducibility js Te- 
production! Verbal assertions of “sig- 
nificance” have no more 


meaning than 
the droit du pour at the court of Louis 


The solution is to separate quite clearly 
and decisively the objective statement o 
the probability of disproof of the mull hy- 
pothesis (by means of a two-tailed test) 
and the subjective evaluation and inter. 
pretation of the results. The reader 
would be able to accept the first state- 
ment as a statement of fact and would 
then be able to judge for himself the ar- 
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guments presented by the author regard- 
ing the meaning of these facts. These ye 
guments might be based on Tesults Gr 
previous experiments, predictions made 
on the basis of more or less widely ac- 
cepted theories, number of cases involved, 
a priori lack of accepability of the con- 
clusions, and other similar grounds; an 
explicit statement of the arguments would 
enable the reader to decide for himself 
the acceptability of the conclusions in a 
manner precluded by the simple state- 
ment of one-tailed probability. A state- 
ment of one-tail probability is not a state- 
ment of fact, but of opinion, and should 
not be offered instead of, but only in 
addition to, the factual two-tailed proba- 
bility; if it is offered at all, it should be 
accompanied by a full statement of the 
arguments in favor of its facilitating a 
more meaningful interpretation of the 
data. In the writer's opinion, it would 
be better to drop such statements of one- 
tailed probability altogether and rely en- 
tirely on appropriate argumentation to 
establish the meaning of the observed 
(two-tailed) Probabilities, 

Implicit in this recommendation is the 
corollary that the mechanical evaluation 
of experimental results in terms of “sig- 
nificant” and “not Significant” he dropped 
outright, Interpretation is implicit in the 
statement of one-tailed Probabilities, but 
it is also implicit in the Statement of two- 
tailed Probabilities if these are auto- 
matically interpreted as being significant 


with all the surplus 
these terms. The 
ve his (two-tailed) 
Oceed to argue re- 
ility of the conclu- 
already indicated. 

here have appeared in the literature 
solemn discussions about the possible 
causes for discrepancies between two ex- 
periments, one of which gave significant. 
the other insignificant results: yet the 
Tespective £ values were almost identical. 
one lying just on the one side, the other 
just on the other side, of the arbitrary 
5% line. Such arguments are unrealistic 
and would he avoided if p values were 
compared, rather than verbal statements. 
WO experiments giving p values of .048 


i“ h 
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= -056 are in excellent agreement, al- 
though one is significant, while the other 
1s not, 
BS i ag the main point of this 
“Stacia WE would say that verbal 
at ae regarding “significance” are 
incony supererogatory restatements in an 
APA any dichotomous form of results 
continny properly stated in terms of a 
ice of p values; at worst 
of y carry unjustified surplus meaning 
an entirely subjective kind under the 
tag an objective and mathematically 
rip statement. Subjective judg- 
abl t of reproducibility cannot reason- 
tion be based on the mechanical applica- 
fuli of a rule of thumb whose only use- 
ess lies in the elementary instruction 
OF undergraduates lacking in mathemati- 
cal background; if they are to be made at 
all they demand complex consideration of 
a priori probabilities. It is suggested 
that the accurate and factual statement 
Probabilities (two-tailed) should be 
Mandatory and that all subjective con- 
«iterations, arguments, and judgments 
oe be clearly separated from such 
Actual statements. It is implied that 


271 


judgments of “significance” belong with 
the subjective side, and it is also implied 
that the calculation of p values on the 
basis of one-tailed tests has no place in 


psychology. 
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Recently, Mathews and Wertheimer 
have derived a theorem from the Kohler- 
Wallach (1944) theory of figural after- 
effects in order to provide an experimen- 
tal test to discriminate between that 
theory and the Osgood-Heyer (1952) 
theory. This test must fail, however, be- 
cause the principle that is invoked is 
contained in both theories, and the same 
prediction is at least as readily derived 
from either. Instead, the contribution of 
Mathews and Wertheimer lies in drawing 
attention to some commonly overlooked, 
but logically necessary consequences of a 
satiation-type theory. The rule that dis- 
placements will be uniformly symmetrical 
with respect to the contours of any I- 
figure has come to be accepted as a gen- 
eral statement of Kohler’s theory and 
empirical findings. For example, Sagara 
and Oyama (1957) reported several ex- 
periments in which asymmetrical dis- 
placements were found as a contradiction 
of Kohler and Wallach (1944). The 
interpretation that Kohler and Wallach 
gave for this same finding in their orig- 
inal paper is rejected by Sagara and 
Oyama as a special ad hoc suspension of 
the rule of symmetry. In part, the object 
of the following analysis is to show that 
asymmetrical displacements are a logically 
necessary consequence of Kohler’s as- 
sumptions in a wide variety of situations, 
while the rule of symmetry only applies to 
a restricted class of figural aftereffects. 

The satiation of Kohler and Wallach 
and the neural adaptation of Osgood and 
Heyer are processes which arise from the 
contours of a figure when it is seen 
against a contrasting field. Satiation 
spreads to regions about the contours of 
the figure in a manner determined by elec- 
trotonic field-forces assumed to exist in 
the visual projection area of the cortex, 
Neural adaptation spreads about the con- 
tours in a manner determined by the 
statistical spread of neural excitation in 


the visual cortex caused by “the simul- 
taneous action of physiological nystagmus 
and reciprocal overlap of dendritic proc- 
esses” (Osgood & Heyer, 1952, p. 102). 
As Osgood and Heyer freely admit, once 
each theory has accomplished its genu- 
flections to the particular sort of ritual- 
istic physiological speculation favored by 
its authors, the results are essentially 
equivalent. With time, a gradient of 
satiation (or neural adaptation) is formed 
about the contour. The gradient is high- 
est directly over the contour and decreases 
with distance in either direction, tending 
to be sigmoid in shape due to an assumed 
flattening at the peak. In general, dis- 
placements will be from regions where 
the gradient is high to regions where it is 
low. The derivation up to this point is 
illustrated in Fig. 1, where CD represents 
a line presented as an I-figure, P and P’ 
represent the loci of T-figures presented 
after CD has been removed, and the ar- 
rows indicate the direction of displace- 
ments. If either P or P’ js brought too 
close to CD, the gradients will extend 
symmetrically in both directions so that 
no displacement can occur, The extent of 
displacement has been found to increase 
with distance from the contour up to a 
maximum and then decrease again (the 
distance paradox). To account for this, 
both theories assume that the extent of 
displacement is a complex function of the 
height and the steepness of the gradients. 
Such symmetry of direction and extent, 
with CD representing the axis of sym- 
metry, can be expected only as long as the 
I- and T-figures are analogous to the 
diagram of Fig. 1 which will be referred 
to here as Paradigm I. The analogy 
will continue to hold when the I-line, CD, 
is widened to become an oblong or square, 
but only so long as the T-figure does not 
become appreciably smaller than the I- 
figure. 

Paradigm I is the illustration most 
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Fic. 1. Paradigm I. 


frequently mentioned in the theoretical 
“iscussions of both Kohler and Wallach 
and Osgood and Heyer. Also, analogies 
Paradigm I comprise the more in- 
aM obvious and striking of the vari- 
ua demonstrations reported by Kohler 
lial Wallach, as well as those most re- 
Roy confirmed by subsequent investiga- 
ee _ Thus, it is understandable that 
ae simple paradigm has come to be ac- 
mered as the general model of figural 
aftereffects according to satiation theory. 
S Kohler would say, “Nothing could be 
arther from the truth.” Consider the 
‘agram in Fig. 2. Here the square 
Ser CD represents a solid L-figure pre- 
l ated against a contrasting field (e.85 
sack square on white field). The point 
br represents the locus of a T-figure ap- 
Preciably smaller than ABCD that is 
resented outside the contours of the T- 
‘Sure and near the contour CD, while 
Tepresents the locus of a similar T- 
cette presented near CD but within the 
Ontours of the I-figure. The solid ar- 


1R, : at, 
bee, tensive reviews of this literature have 
gi? Published recently by McEwen (1958), 


os and Oyama (1957), and Spitz 
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rows represent the symmetrical displace- 
ments on either side of CD that would be 
predicted merely by superimposing the 
T-line of Paradigm I on the new diagram. 
Clearly, this would not be justified. Un- 
less the I-figure is so wide or the I-period 
is so short that the tissue satiating cur- 
rents flowing from the opposite contour are 
negligible, displacement at Q should not 
be symmetrical with displacement at P. 
Without doing violence to the rest of 
the theory it is necessary to suppose that 
the satiating currents arising from the 
opposite contours, AB and CD, combine 
with each other in many cases. As a 
result, the satiation at loci inside the con- 
tours will always be greater than that at 
an equal distance outside the contours. 
At first, the cross sectional pattern of 
satiation will be saddle-shaped with the 
level of satiation in the cleft of the saddle 
rising more rapidly than that of the out- 
ward sloping gradients. With time a 
relatively flat dome-shaped pattern will 
emerge which spans the distance between 
the contours. Eventually, a homogeneous 
degree of satiation will be obtained be- 
tween the contours and extending for 
some distance beyond them (Kohler & 
Wallach, 1944, p. 343-344). A precise 
statement of the rate at which each stage 
will appear and then give way to the 
next is beyond the scope of the theory in 
its present form. Nevertheless, with a 
suitable choice of I-figure, T-figure, and 
T-period, it should be possible to find a 
locus Q between the area of greatest sati- 
ation and the contour CD such that the 
direction of displacement will be towards 
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and Wertheimer (1958). 


CD. This is indicated by the broken 
arrow in Fig. 2. 

Despite the claim of Mathews and 
Wertheimer (1958), which they have 
documented with a personal communica- 
tion from Osgood, precisely the same 
theorem can be derived from the Osgood- 
Heyer theory. Osgood and Heyer (1952, 
p. 108) explicitly state that the statistical 
distributions of neural adaptation arising 
from opposite contours must overlap and 
summate with each other in a manner 
analogous to the satiation of Kohler and 
Wallach. Their Fig. 2 (1952, p. 105) 
is a cross sectional representation of 
Paradigm I. It becomes Paradigm II 
merely by superimposing upon it the 
overlapping gradient of neural adaptation 
which might be expected from a second 
contour. If this is done, the predictions 
of the Mathews-Wertheimer theorem 
must follow. Of course, to obtain a 
prediction for a specific I-figure it will 
no longer be possible to avoid specifying 
the parameters of the statistical distribu- 
tions as it was in the original analysis 
presented by Osgood and Heyer. 

Mathews and Wertheimer (1958) per- 
formed an experiment which they de- 
signed as a test of their theorem. The 
T- and T-figures they used are drawn to 
scale from their description in Fig, 3, 
The I-figure was a solid, black, 2-in. 
square corresponding to ABCD in Fig. 2, 
while TL, TC, and TR were three 44-in, 
squares presented simultaneously as a T= 
figure after a 2-min. inspection of the T- 
The locus of TL corresponds to 


figure. 
locus Q in Paradigm II. Each of 16 
observers (Os) was asked to report 


which distance was greater, that between 
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TL and TC or that between ‘TC and TR, 
and 14 chose the right hand distance as 
the greater. According to Mathews and 
Wertheimer: “This result favors the 
Kohler-Wallach prediction, by bimodal 
expansion, at the 1% level.” Since no 
further data are submitted, we must sup- 
pose that this probability level was ar- 
rived at by assuming that 16 other Os 
who had not inspected the I-fgure—that 
is, a control group—would have been 
equally divided in their choices. Un- 
fortunately, those of us who do not con- 
cur in this admittedly common practice 
of assuming the performance of a control 
group must remain in doubt as to the out- 
come of the experiment. Further elabo- 
ration of this point is unnecessary as the 
importance of such a control in the study 
of figural aftereffects has already been 
discussed at some length by Brown 
(1953) and Wertheimer (1954). 

In any event, even with a control 
group, the particular T-figure chosen by 
Mathews and Wertheimer could never be 
used to study the displacement of the 
square TL. All objects in the T-figure 
must be assumed to exhibit some dis- 
placement. Consequently, the best pos- 
sible measurement of the extent or direc- 
tion of displacement is the relative 
displacement of a near T-object when 
compared with that of a second, remote 
T-object. When the relative displace- 
ments of both the near and the remote T- 
objects are referred to that of an inter- 
mediate, third T-object no meaningful 
result can be obtained. The Os could 
report that the distance TC—TR is 
greater than the distance TL—TC when 
both TL and TC are displaced away from 
the contour in any case where the dis- 
placement of TR is greater than the com- 
bined displacement of TL and TC. In- 
deed, this might occur very commonly 
due to the distance paradox. Similarly: 
the distance TL—TC could be reported 
as greater even when TL is displaced 
towards the contour in any case wit 
the displacement of TC away from t 
contour is greater than the gambi 
displacement of TL and TR. The uit 
uitous distance paradox would rea 
account for this outcome also. 
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Fortunately, Fox (1951) has already 
reported two technically more adequate 
&xperiments that fulfill the topological 
rduirements of the Mathews-Wertheimer 
uaa In Experiment I, Fox found 
or ; “figures presented on the contour 
a Just inside of it were displaced out- 
ards, but to a smaller extent than T- 
gures Presented outside the contour. 
FIE, Somewhat different I-figures, in 
Xperiment IT he found that displace- 
rose were symmetrically away from the 
eit est contour for a variety of loci on 
in £" side of the contour. Fox concluded 
ae of Kohler and Wallach on the 
asym of Experiment TE attributing the 
an aracttical results of Experiment I to 
nes tifact not present in Experiment IT. 

Might at first appear that in order 
fee the Mathews-Wertheimer 
clusion” Fox should have based his con- 

n on the results of Experiment I 
sults gy ained away the symmetrical re- 
Would, “Xperiment IT. But this, too, 
deresti Oversimplify the situation and un- 
theor imate the full power of the satiation 
o a Clearly, both the initial pattern 
tion tion and its variation as a func- 
z time must depend upon the par- 
-figure that is in question. In 
plex situation described by Para- 
OF simp, 07'S results can be accounted 
Use nply by noting that the I-period he 
dug Must have been long enough to pro- 
the asymmetrical displacements with 
~ and T-figures of Experiment I, 
With those of Experiment II. 
resents athews-Wertheimer theorem rep- 
also > Case 1 of Paradigm II. It is 
2, Possible to derive its converse, Case 
loci sae the direction of displacement at 
time Side the contours reverses with 
nder certain conditions. That this 
Satin tO is implied as a consequence of 
Alla theory was noted by Kohler and 

ach (1944) : 
Pose t 


half 


hat the T-squares lie half inside 
D Osit 


Outside the area of the I-oblong [on 
s contours], If the inspection period 
Sq too short the distance between these 
Will decrease. Even with T-squares 
ee entirely outside the oblong but 
Jacent to it the same will be observed 
S Provided that the inspection period 
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has been sufficiently long... the direction 
of the effect depends upon the degree in 
which the medium has been altered by pre- 
vious inspection. Also, during late stages of 
the inspection period a figure process may 
not affect the medium in quite the same way 
as it does in the beginning. Incidentally, in 
the same arrangement slight variations of 
the I-object may turn the effect in one direc- 
tion or the other (p. 297). 


The existence of Case 2 displacements 
towards the contour that Kohler and Wal- 
lach reported have since been confirmed 
by Smith (1954) and by several Japanese 
studies reviewed by Sagara and Oyama 
(1957). The explanation seems to de- 
pend upon the existence of I-figures which 
give rise to gradients which are steeper 
inside the contour than outside the con- 
tour. Both Kohler and Wallach (1944, 
pp. 339-340) and Osgood and Heyer 
(1952, p. 108) invoke this principle to 
account for an otherwise anomalous dis- 
crepancy between outline and solid fig- 
ures. In such a case the peak will be 
shifted outwards somewhat, and further 
summation and interaction with the gra- 
dient extending from the Opposite con- 
tour should increase this shift. The 
effect will not always be observed, of 
course; but with a suitable choice of I- 
figure, T-figure, and I-period, it should 
be possible to find a locus p outside the 
contour such that displacement will be 
towards the contour. In connection with 
this, Sagara and Oyama (1957) report 
that where Case 2 reversals were found 
the extent of nonreversal displacement 
was greater inside the contours than out- 
side. This is precisely the result that 
would be expected if the inside gradient 
is steeper than the outside gradient in 
Case 2. 

There is essentially no contradiction 
between the predictions of Case 1 and 
those of Case 2. Each is appropriate to 
a separate class of I-figures. It is in- 
tuitively obvious that different classes 
of I-figures must give rise to different 
patterns of electrical activity in the visual 
cortex. To claim otherwise is to do vio- 
lence to language. However, the various 
patterns are related by a common neuro- 
logical mechanism. According to Kohler 
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and Wallach, this mechanism is the elec- 
trotonic field-forces of satiation. Accord- 
ing to Osgood and Heyer, it is the statis- 
tical distribution of neural adaptation. 
Neurological evidence has been cited on 
both sides. To be sure, the neurological 
data available to the authors were not 
sufficient to permit either theory to spe- 
cify the particular pattern of cortical 
activity that would be produced by any 
particular set of conditions. This is un- 
derstandable when we consider that the 
evidence that is cited pertains to phe- 
nomena that take place within fractions 
of a second, while figural aftereffects can 
extend over several minutes at least. 
But, with disarming frankness, both Koh- 
ler and Wallach and Osgood and Heyer 
admit the need for much further research 
in this area. In the interim, the gap is 
bridged by boldly assuming that in cases 
of Paradigm I the long-term neurological 
processes will be analogous to the short- 
term processes, while in cases of Para- 
digm II summation and interaction over 
time will produce precisely the gradients 
that are necessary to account for the 
figural aftereffects that occur. Thus, 
whether the neurophysiologists of the 
future opt for field-forces, or statistically 
independent units, or—as in modern 
physics—some happy combination of the 
two, it is probably safe to say now that 
perceptual phenomena are related to neu- 
ral activity. 
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CLASSIFICATION AND STATISTICS 
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inta e Classification of psychological scales 
catego minal, ordinal, interval, and ratio 
Significar, by Stevens (1951) has been a 
a pont contribution to psychophysical 
tion Rec rement theory. This classifica- 
by a been widely accepted and applied 
aiid chologists in conversation, teach- 
1959. nd in publication (e.g, Edwards, 
Siegel Guilford, 1954; Senders, 1958; 
s teven 956). In considering these scales 
tistical has specified the appropriate sta- 

Measures for use with each. This 


latter i 
act aspect has had a tremendous im- 
exampl writers of statistical texts. For 

<n ple, these categories form the basis 


o 
f ae texts by Senders (1958) and by 
this « (1956), There is no doubt that 
lassification has great practical and 
a Heal value, However, we feel that 
ieblication of this classification has 
to the $ e misunderstanding in regard 
and to Se of various statistical techniques 
nonparar. overemphasis on the utility of 
icaj o ametric techniques in psycholog- 
research, 
tose example of this tendency Siegel 
Metri + 26) maintains that nonpara- 
With g as of significance should be used 
"equir Ubinterval type data. He lists the 
Seoti of interval scale as one of the 
Pata et for the use of the analysis of 
Not ce. However, this assumption can- 
Ma a oe if one looks to the math- 
1947) bases of assumptions (Eisenhart, 


Tn fact a noted statistician, Kemp- 


le 
led 


Tee 
ically (1955), after showing mathemat- 
Varj that the normal theory analysis of 


la, 

domin® test can approximate the ran- 
Th ‘tion test, states that 

is i 

the sass as some theoretical basis for 
fatistic; Which has been noticed by most 
pe ata that the level of significance of 
ste sees of variance test for differences 
[vice a treatments is little affected by the 
italics added of measurement for analysis 


The important consideration for the use 
of the analysis of variance is not that the 
data have certain scale properties but that 
the daia can be related to the normal dis- 
tribution, plus approximating the other as- 
sumptions of independence and homogen- 
eity of errors. Furthermore, it must be 
emphasized that the normal distribution 
refers to the distribution of those portions 
of the data which are used as the appro- 
priate estimates of error and not neces- 
sarily to the distribution of the observa- 
tions. This is an important consideration 
because in multivariable designs the var- 
jation within each group may not dis- 
tribute normally but the portions of cer- 
tain interaction terms may, In many of 
these designs some of the interaction 
effects provide the valid estimate of error 
in testing certain effects. 

Tt would seem that the comments con- 
cerning the relation between the various 
scales and the appropriate statistical pro- 
cedures should serve merely as a guide. 
In some cases empirical data may indicate 
that rigid adherence to such statistical 
methods is not required and wasteful of 
the data. The author has propounded 
elsewhere this viewpoint and cited rel- 
evant data concerning the analysis of 
variance technique (Gaito, 1959). 

Furthermore, the same data may be 
considered to have the properties of two 
or more scales, depending on the context 
in which it is considered. For example, 
if we look at the response of one subject 
(S) to a single item, the properties of the 
data are those of a nominal scale, i.e., 
right or wrong. However, if we concen- 
trate on the total score for one S or the 
total scores for a group of Ss, we have at 
least an ordinal scale. This is similar to 
the situation of approximating the bi- 
nomial distribution (a distribution hav- 
ing nominal, or possibly ordinal, scale 
properties) by the normal distribution (a 
distribution having at least ordinal, pos- 
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sibly interval, scale properties) when a 
increases and p is close to .5. Another 
example is the relation between the sign 
test and the binomial distribution test. 
For a given set of data these two methods 
will give the same result; yet Siegel lists 
the former as of an ordinal nature, the 
latter as nominal data. 

It appears that in dealing with descrip- 
tive statistics the comments made about 
the use of nonparametric statistics for 
subinterval-type data are most appro- 
priate. However, in dealing with tests of 
significance such statements are not as 
appropriate, even though descriptive sta- 
tistics are involved in the procedure. For 
each test the mathematical requirements 
are expressed in the assumptions, and 
nothing concerning specific scale prop- 
erties is required. Supplementing the 
requirements will be evidence to indicate 
the effects on the data of failure to meet 
the assumptions (Gaito, 1959; Lindquist, 
1953). Thus if empirical research indi- 
cates little effect on the data when as- 
sumptions are not met or merely approx- 
imated, the investigator may use a para- 
metric technique and allow for the effects 
of the deviations from assumptions in his 
interpretation. 

It is encouraging to note that some 
individuals have been reluctant to em- 
brace wholeheartedly the nonparametric 
technique (e.g., Guilford, 1954; Mc- 
Nemar, 1957, 1958; Savage, 1957). In 
a review of the statistical literature in the 
Annual Review of Psychology, Grant 
(1959, p. 137) appropriately summarizes 
this position by the comment: “Some 
much-needed negative thinking has re- 
cently appeared on nonparametric tech- 
niques” [italics added]. 
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In conclusion we wish to restate that 
the statements concerning scale proper- 
ties and statistical procedures should be 
guides and that context, mathematical as- 
sumptions of statistical procedures, and 
the results of research concerned with 
failure to satisfy assumptions should be 
the ultimate determiners of the choice of 
statistical techniques. 
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fe importance of a concept of the 
Need ig in psychology does not 
all, Si be documented. Most, if not 
o the theories of personality em- 
Such di Sonten eat their treatment ot 
anisms verse problems as defense mech- 
and ah: Motivation, psychopathology, 
Dory chotherapy. Despite the im- 
Progress of this concept little or no 
objergp ; 25 Peen made in refining and 
the ying it. The current treatment 
aX ae es is little if any more 
Cept on than the loose animistic con- 
Rives rte by Fre l. This looseness 
anq ec to excessive surplus meaning 
hg i ly powers (occasionally border- 
Uneritie me mystical) are glibly and 
TOCes ly attributed to this pervasive 
exputile there has been a resurgence of 
Jl aie interest in unconscious 
lena, this research for the most 

tionay c's to be primarily demonstra- 
The ' nature rather than parametric. 
Oper, CMplexity of the experimental 
nities OMS affords numerous opportu- 
Tesult, or attifacts to occur and the 
l E generally overinterpreted. 
Attems latter connection little or no 
Pts are made to relate the phe- 


S Dhi 
rant Mt Paper was facilitated by Research 
c -1206, National Institute of Mental 


aly 
ù Publie Health Serv 


i 
nomena to the general body of experi- 
mentally derived psychological knowl- 
edge. 

This paper presents a representative 
survey of experimental approaches to 
unconscious phenomena with the pur- 
pose of examining methodological and 
definitional problems that are en- 
countered. An attempt is made to 
separate different phenomena that have 
been lumped together under the general 
concept of the unconscious and to 
treat these phenomena in terms of cur- 
rent laboratory derived concepts. 


PROBLEMS OF DEFINITIONS 


The nonscientific nature of current 
conceptions of the unconscious is at- 
tested by the vagueness of definitions of 
what constitutes conscious and un- 
conscious behavior. Not only are the 
definitions vague and operationally in- 
adequate but there are several dif- 
ferent terms that are used, sometimes 
to designate the same phenomena, other 
times in a partially nonoverlapping 
way. Such terms as “consciousness,” 
“awareness,” “insight,” and “attention” 
have all been used to describe the same 
phenomena. I do not intend to trace 
the differences or the commonalities 
among these different terms and their 
application to specific problems but 
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merely to indicate the nature of the 
confusion that exists. In the remainder 
of the paper I shall use the terms 
awareness and unawareness to desig- 
nate what is generally subsumed under 
the psychoanalytic terms, conscious and 
unconscious. I prefer the term aware- 
ness since it seems to reflect more 
closely the psychological quality of 
the phenomena whereas consciousness 
seems more appropriate for the desig- 
nation of the state of physiological 
alertness of an organism. 

The most frequent definition of 
awareness and unawareness is in terms 
of verbal report. Awareness is equated 
with the ability to verbalize and un- 
awareness with the inability or lack of 
verbalization. Freud (1925) in his 
extensive writings can be found on oc- 
casion to have defined awareness and 
unawareness in this manner. Such a 
distinction has been popular with other 
writers on the subject with Dollard 
and Miller (1950) most explicit. They 
define unawareness as the unverbalized 
or the unlabeled and their distinction 
between aware and unaware processes 
follows from their conception of the 
role of verbalization in the individual’s 
adjustment. Verbal labels are assumed 
to increase the precision of discrimina- 
tion and to mediate the transfer of 
training through the cue producing 
characteristics of the labels. 

The popularity of a definition of 
awareness in terms of verbal report is 
understandable. It creates the illusion 
of operational precision and helps to 
mask the fact that many of the more 
important characteristics of awareness 
are avoided by the definition. Such 

an equation might seem a simple solu- 
tion to the definitional problem but it 
is to be noted that the operational 
meaning of such a definition uf any 
given experimental situation hinges 
critically upon a number of considera- 
tions. Among these we need to con- 
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sider the adequacy of the questioning 
of the subject (S), the motivation. of 
the S to respond with the care and 
precision that is required, the care 
taken to assure that the S understands 
what is being asked him, consideration 
of the effects of the interrogation itself 
upon the delicate process of awareness, 
and most importantly an adequate 
schema for classifying the S’s verbaliza- 
tions along relevant dimensions. As- 
suming the last criterion can be met, a 
further requirement is some system of 
scaling or grading the S’s verbaliza- 
tions along these dimensions in terms 
of their accuracy or specificity. It is 
not sufficient, as is too often the case, 
for the experimenter (E) to make a 
general statement that interrogation of 
his Ss revealed that they were unaware. 
It is the Æ’s decisions with respect to 
the above considerations that constitute 
the adequacy of his operational defini- 
tion of awareness and any value the 
concept has depends upon the similarity 
of these operations from one experi- 
ment to another, 

While attention to the above require- 
ments constitutes a minimum standard 
for a definition of unawareness, it also 
is to be noted that there are some basic 
objections to the equation of awareness 
with a verbal report. There is nothing 
profound in noting that language and 
words are highly abstract symbols and 
bear little or no physical resemblance 
to the objects, events, and relationships 
that they denote. A definition of 
awareness in terms of verbalization 
places a heavy burden upon the ade- 
quacy of language to reflect the richness 
of perceptual experiences and images: 
The equation of awareness with ver nee 
report leads to a similar kind of oon 
founding of response’ and concept fie 
has occurred with perception and ie 
responses from which perception i 
inferred (Garner, Hake, & Erikse™ 
1956). 


DISCRIMINATION AND LEARNING WITHOUT AWARENESS 


There is also the question of how 
pec a definition would handle volun- 
ary responses other than verbal report. 

us in an experiment on complex 
pe discrimination an S may be 
Bimulue o to describe verbally the 
But he s that has been presented to him 
rect sti ar accurately choose the cor- 
ton š Si us ma multiple-choice situa- 
Hon of = eae one’s verbal descrip- 
individ facial appearance of another 
a? may be wholly inadequate in 
=... another to recognize this 
ee among a crowd. However, if 
he giving the description pos- 
Would $ mediocre artistic talent, it 
Sketch De possible for him to draw a 
would of the person described which 

A aty increase the chances of 
rae oe s being recognized. In 
Da ee tage if we adhered strictly 
boi of awareness in terms of 
say ag it would be necessary to 
ae the S was not aware of what 
like ge perceptual stimulus was 
Pearane of the described person's La 
feel ty ia However, most of us wil 

Nat this is failing to capture what 


eb when we describe lack of 
ness, 
bal 3 equation of awareness with ver- 
et also would exclude from 
e aon a large body of experi- 
Wit data that has supposedly dealt 
atte problem of awareness. As 
the n (1957) has pointed out, one of 
er} Ost Teadily demonstratable ex- 
Sho, Sal arrangements supposedly 
In TOE unawareness is the following. 
minag l Ychophysical threshold deter- 
P °n if the S is instructed to give 
lüdgm dence rating along with his 
judg, Ms, it is frequently found that 
Pure sents which the S reports as 
“imine iesses show above chance dis- 
with re tion. Tf we equate awareness 
Yon “thal reportability, however, we 
ave to classify this as dis- 
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crimination with awareness since the S 
is making a discriminated verbal report. 
I have raised these objections and 
problems concerning the definition of 
awareness, not because I have a solu- 
tion for them or a better definition to 
offer, but solely to insure that the 
issues raised in the remainder of this 
paper will be evaluated in terms of the 
limitations of our current definitions. 
I believe that in view of our present 
knowledge in this relatively unexplored 
research area, a definition of awareness 
in terms of a verbal report is a begin- 
ning point. But as was pointed out 
above any value such a definition has 
depends critically upon the operational 
specifications used in assessing the Ss’ 
verbalizations or lack thereof. 


DISCRIMINATION WITHOUT 
AWARENESS 


The question of discrimination with- 
out awareness is two separate questions 
with quite different implications. When 
we talk about discrimination without 
awareness we might mean : (a) is there 
a more sensitive (accurate) discrimina- 
tion that occurs at levels not available 
to awareness; or (b) are we aware of 
all the discriminations that we make. 
The first question concerns the pos- 
sibility that our nervous system dif- 
ferentially responds to stimuli or cues 
whose intensity is below that where a 
differentiated or discriminated verbal 
report can be obtained (subliminal 
discrimination). The second question 
concerns the possibility of differential 
responses to stimuli or cues above 
thereshold but which upon questioning 
the S is unable to state that he used 
these particular stimuli as cues for his 
responses (incidental discrimination), 
These two questions are seldom dis- 
tinguished and it is not uncommon to 
find a psychologist arguing the af- 
firmative answer to the first question 
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by citing experimental data that only 
demonstrate the latter. 

When discrimination at levels below 
that available to awareness is explicitly 
considered, it is often assumed that the 
discrimination occurring at these levels 
is only of a rather gross type. Cason 
and Katcher (1933) give a very plau- 
sible neurophysiological speculation in 
terms of how such subliminal stimuli 
could operate. They state: 


Before a conscious activity comes into 
existence, the energy that is produced or 
liberated in the body must be a certain 
required strength. The nervous energy 
frequently leads to some bodily effect even 
though its strength is not sufficient to 
produce conscious activity. It seems reason- 
able to believe that energy of this subliminal 
strength can be connected with some re- 
sponse by the learning process and if this 
could be demonstrated the theoretical im- 
plications would be important (p. 831). 


Lazarus (1956) in defending his 
interpretation of the subception effect 
has suggested that discrimination at 
levels below that available to awareness 
as measured by the GSR, is capable of 
making gross discriminations between 
threatening and nonthreatening stimuli. 
Osgood (1957) has made a similar 
suggestion for interpreting perceptual 
sensitization and defense data. He 
conceives that an overall affective eval- 
uation occurs early in the perceptual 
process and 


that hints provided by the incomplete 
information in the stimulus display, by 
eliciting the dominant evaluative (emo- 
tional) components of mediating reaction 
feed cues back to the perceptual 
system, may produce either facilitation of 
covaluent integrations (vigilance) or dis- 
organization of integrative mechanisms 


(defense). 


which 


He further suggests that this process 


may be mediated by lower brain 


centers. 
Conceiving of discrin 
below awareness as only 


aination at levels 
a rather gross 


CHARLES W. ERIKSEN 


type of discrimination has a certain 
plausibility to it. It avoids the concept 
of the “unconscious” as a sort of super- 
natural entity with mysterious powers. 
On cursory examination it would even 
appear to have experimental support. 
Binder (1955) among others, has 
pointed out that one or more of 
the characteristics of a stimulus may be 
discriminated in the absence of suf- 
ficient information of the remaining 
characteristics to permit specific identi- 
fication. Thus, at a brief exposure 
we may be able to discriminate the 
color of a stimulus without perceiving 
its texture or its shape, and, if we 
can discriminate the color of a stimulus 
without also perceiving the texture and 
shape necessary for identification, why 
couldn’t we be expected to discriminate 
the affective value of a stimulus in the 
absence of a specific identification? 
The difficulty with such a conclu- 
sion is this. Generally, the meaning 
of a stimulus is conveyed not by one 
or two physical characteristics but by 
the total aggregrate of characteristics. 
In the case of words, which are fre- 
quently used as stimuli in perceptual 
defense studies, the affective meaning 
is conveyed only by the perception of 
a complete word, not by isolated char- 
acteristics such as word length or that 
there is a tall letter at the beginning 
of the word. To discriminate the word 
shot from a vulgar four letter Anglo- 
Saxon word would require a very pre 
cise discrimination as to a subtle form 
difference in only one of the four 
letters. For discrimination of the al 
fective meaning of a threatening stim 
ulus to occur, some place in the nervous 
system the complete word would have 
to be identified. So, we find that at 
what first seems to be a plausible as- 
sumption of only very gross discrimina 
tion at levels below awareness, 15 Rie 
fact an assumption of very ame 
discrimination, provided it is tO 


OO 
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used to explain such phenomena as 
Perceptual defense. 


per imental Studies of Subliminal 
Discrimination 


What experimental evidence is there 
feels nmap for discrimination at 
ae Y that available to verbal 
method l ne can distinguish four basic 
join a ogical approaches to this prob- 
to oe a first consists of attempting 
Race Hon some nonyoluntary re- 
or oo as pupillary contractions 
ithe a stimulus presented with an 
termines that has been previously de- 
Dion) to be subliminal. The second 
similar is to condition a GSR or 
Stimul Ronvoluntary response to a 
then paa at supraliminal levels and 
Daa Kalra ee ti 
Method i for the stimulus. The third 
bined carn which is sometimes com- 
the Gsp the other two is to condition 
or Hae or some nonverbal response 
Paces prior conditioning of this 
resented to a stimulus that is then 
a a threshold values. The 
© det al response is then observed 
han Peg whether it shows greater 
ulus ee relationship with the stim- 
idhin, those trials when the verbal 
methodol, was incorrect. The fourth 
Teauey ie is to present a stimulus, 
Assuy atly a complex one, at a level 
log, wel to be subliminal and then to 
suj oF the effect of the stimulation in 


+ OS 
. dent behavior such as dreams, 
I shall 


of 


& Ss but instead select representative 

cra} Ments which illustrate the gen- 

Proh ‘ndings and the methodological 
ms encountered that limit the 
Pretation of the data. 


tbls s 
mi PE P E 
i nal Conditioning 
urne ‘ ae 
ditio iy Evidence for subliminal con- 
Ne ic 
S is for the most part negative 
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or at best controversial. One of the 
more intriguing studies is that of Baker 
(1938) who reported that the pupillary 
light reflex could be conditioned to a 
tone below the absolute threshold. 
Using the below threshold tone as a 
conditioned stimulus (CS) and change 
in illumination as the unconditioned 
stimulus (US), he was successful in 
establishing a conditioned response to 
the subthreshold tone within three 
trials. In addition, responses con- 
ditioned to the subthreshold sound 
generalized less than responses con- 
ditioned to suprathreshold values. They 
were also more stable and resistant to 
extinction. 

In a repetition of the study, Metzner 
and Baker (1939) obtained essentially 
the same results, but other investigators 
have been unsuccessful in attempts to 
duplicate this experiment (Hilgard, 
Miller, & Ohlson, 1941; Wedell, 
Taylor, & Skolnick, 1940). Wedell, 
Taylor, and Skolnick found no evidence 
of conditioning in any of the Ss they 
employed and a number of careful in- 
vestigators have been unable to obtain 
pupillary conditioning even to supra- 
threshold stimulus values (Hilgard, 
Dutton, & Helmick, 1949; Steckle, 
1936; Steckle & Renshaw, 1934; Stern, 
1948). 

In a recent experiment Taylor 
(1953) claims to have demonstrated 
subthreshold conditioning, but here 
again this interpretation of his results 
is open to serious question. He pre- 
sented six different geometric figures 
tachistoscopically one at a time to 
determine a level of exposure where 
the Ss were unable to identify at better 
than at chance level which specific geo- 
metric figure was being presented. He 
was successful in obtaining GSR con- 
ditioning at an exposure level where 
the Ss were unable to correctly identify 
the different geometric figures. 

While Taylor interprets his data in 
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terms of subthreshold discrimination, 
there is a more simple and plausible 
explanation of his findings. While his 
Ss may have been unable to identify 
which geometric form was being ex- 
posed, this in itself is insufficient evi- 
dence for assuming that all six geo- 
metric forms at this level of exposure 
resulted in the same perceptual ex- 
perience. It is more than likely that 
different perceptual experiences were 
associated with each of the six geo- 
metric forms at the level of exposure 
selected by Taylor. This is not to 
say that the information the S received 
was sufficient to specify what form was 
being shown, but merely that different 
experiences or perceptions were as- 
sociated with the different forms. For 
example, the exposure of a square 
might have led to a description on the 
part of the S of a perception of a dark 
smudge that seemed to sort of expand 
in the field. A free report of his 
perception following exposure of the 
triangle might have led to some de- 
scription such as ”I had the impression 
of sort of a slanted line moving across 
the field.” While these perceptions 
would not be sufficient to enable the S 
to label which figure was being pre- 
sented, they nonetheless are distin- 
guishable perceptions and therefore 
capable of differential conditioning or 
even differential verbal response pro- 
vided proper feedback is given. 

In Taylor’s experiment a necessary 
control would have been to determine 
whether the Ss reported the tachisto- 
exposure of a square and a 
triangle as identical perceptual ex- 
periences or as different. By asking 
the S to say “same” or different’ 
cessive exposures of geo- 
one could have deter- 
fferential perceptual 
at the verbally 


scopic 


following suc 
metric figures, 
mined whether di 
experiences existed 
distinguishable level. 


The possibility of subliminal con- 
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ditioning of finger retraction was 
thoroughly and carefully explored by 
Newhall and Sears (1933). Using 
electric shock as the US and a light 
as the CS, these investigators were 
successful in obtaining conditioned 
finger retraction to light intensities at 
the absolute threshold (defined as the 
50% point using the method of con- 
stant stimuli). Threshold determina- 
tions were also obtained throughout 
the experiment. Some of their Ss 
were found to give conditioned finger 
retractions to light intensities below the 
absolute threshold (four of their seven 
Ss gave a total of five conditioned 
finger retractions in the sigma intervals 
of — .5 to — 1.0). 

Newhall and Sears were properly 
cautious in interpreting their data in 
terms of subliminal conditioning. They 
pointed out that while some conditioned 
finger retractions were obtained at 
values below the absolute threshold, the 
threshold in their study was defined 
as the 50% point. Thus by definition 
of the threshold, the Ss verbally would 
report on a number of occasions having 
seen the light at values that actually 
fall below the arbitrarily defined 
threshold. 

Further confirmation that there was 
no super discrimination by unaware 
process in their experiment is found 
when the conditioned response is itself 
used to estimate the absolute threshold 
for light. By using the latter sessions 
in the experiment when conditioning 
had been well established in most $S» 
it was possible to estimate the thresh- 
olds from the frequency of conditioned 
finger retractions. It is interesting 
that the absolute threshold computed 
from the frequency of conditioned re- 
sponses corresponds almost exactly tO 
the value obtained by using the verba 
judgments. 

i The most recent and perhaps mg 
most thorough investigation of st 
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liminal conditioning appears to be that 
of Wilcott (1953). He attempted to 
condition GSR to subthreshold auditory 
pens at several different frequency 
yels. His Ss were given preliminary 
training on threshold determination and 
Ee stimulus intensity selected for sub- 
ed conditioning was the highest 
aaah that the S failed to report 
Be ee during the threshold determina- 
i ae, W hile he was successful 
ter R conditioning to suprathreshold 
Ronditio’’ no evidence was found for 
subth ioned GSRs in response to the 
an teshold intensities at any fre- 
me, level. Interestingly enough, 
Perot experiment appears to be the 
ie one using a subliminal CS that 
Sot e a low enough intensity so as 
ies o yield better than chance verbal 
Ports, 
Poom this brief survey of the litera- 
au can see that the evidence for 
ee at levels below those 
EAN to awareness is not very 1m- 
have ve. In the cases where studies 
i o tained positive results, other 
Mopars have been unable to dupli- 
Slight he findings. In the cases where 
ditioni evidence for subthreshold con- 
Seem on has been obtained the results 
ee o be more readily accountable by 
„T explanations. 
that here are several sources of artifact 
Bigs, to be carefully considered 
Perime interpreting the data of an ex- 
iscriy nt i terms of conditioning Or 
2 aia at levels below those 
Portan £0 awareness. Of major im- 
Tt „2ce is the definition of a threshold. 
test be borne in mind that our 


e see 
Sholds are arbitrary statistical defi- 


In the typical psychophysical 
an S$ is asked to report “yes” 
Absence concerning the presence Or 
the So, of the stimulus. Commonly 
*Pteq © accuracy level has been ac- 
hoy, mee a threshold, It is to be noted, 

t, that this procedure insures 


Or « 
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that the § is discriminating better than 
chance at a stimulus intensity below the 
arbitrary threshold designation. If a 
blank stimulus interval is included in 
the threshold determination, the S’s 
false alarm rate or his tendency to say 
“yes” in the absence of any stimulation 
can be observed. In a good S this 
false alarm rate may be 10% or less. 
Thus, any deviation from this 10% 
point in terms of positive answers or 
reports that he perceives the stimulus 
indicates he is now beginning to per- 
ceive at better than a chance level. At 
a stimulus intensity that the S reports 
perceiving 25% of the time, we are 
below his arbitrarily defined threshold 
value but at the same time we are 
15% above his chance performance 
level, To demonstrate that a condi- 
tioned response can be obtained 15% of 
the time at this stimulus intensity level 
is hardly evidence for concluding dis- 
crimination below verbal awareness. 

Further sources of error in the inter- 
pretation of subliminal conditioning 
studies revolve around the method used 
in obtaining thresholds and the instruc- 
tions and training given the Ss. We 
shall not go into these problems in 
detail here since the interested reader 
can consult the excellent discussion of 
these problems by Goldiamond (1958). 
However, we may note that the 
“threshold” obtained for an S depends 
upon a number of variables. Most im- 
portant are: (a) the instructions and 
expectations of the Ss; (b) the penal- 
ties for false alarms and for failure to 
detect a real stimulus; (c) the training 
given the S; (d) the use or lack of use 
of a ready signal; and (¢) whether or 
not a forced-choice method is employed 


(Blackwell, 1953). Most typically ex- 
periments on subliminal stimulation 


have given only cursory attention to 
the operations by which they have 
concluded that a given stimulus was 
“cybliminal.” Few, if any, would 
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modern psychophysics. 


Assessing Thresholds by Conditioned 
Responses 


A second method for investigating 
the question of a more sensitive dis- 
crimination that occurs at levels below 
awareness is to condition some response 
other than verbal to a stimulus and then 
observe whether this response gen- 
eralizes to values of the stimulus at 
threshold intensities. One could con- 
dition a GSR to a suprathreshold 
stimulus and then observe the fre- 
quency of conditioned responses as the 
stimulus was varied in intensity around 
the threshold level. The frequency of 
conditioned responses as a function of 

the intensity of the stimulus could be 

used to compute a threshold that could 
then be compared with the threshold 
obtained by verbal report. The ex- 
periment by Newhall and Sears de- 
scribed above provided an example of 
this method of threshold determination. 

One of the more interesting experi- 

ments of this type is reported by Davis 
(1950). Recordings were made of the 
action potentials in several different 
muscle groups in response to a sound 
presented above and below threshold. 
Increases in muscle tension occurred to 
stimulus intensities that the Ss did 
not report hearing. Unfortunately, 
these data can be taken as only sug- 
gestive since the method of threshold 
determination followed by Davis was 
not very rigorous. The Ss were not 
given preliminary training nor were 
they alerted for the intervals in which 
the stimulus might appear. In addition 
the instructions given were not of the 
type that generally yield a most sensi- 
tive verbal threshold. 

Even if replication should substan- 
tiate Davis’ finding that muscle tension 
varies at levels of stimulation below 
those capable of differential verbal 
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report, there is still the question of 
what meaning this has for behavior. 
For responses by some part of the 
body to external stimulation to be 
important psychologically it needs to 
be shown that these responses are 
capable of learning and serving as cues 
for behaviorally more important re- 
sponses. 

Recently Dulany and Eriksen (1959) 
reported an experiment in which the 
GSR threshold for brightness was sys- 
tematically compared with verbal re- 
sponse thresholds determined by two 
methods—a modification of Blackwell’s 
forced-choice as well as the more 
typical presence or absence report from 
the S. Ss were trained for 10 days 
prior to a conditioning session. The 
GSR was then conditioned to a supra- 
threshold intensity of the test light 
with electroshock to the ankle as the 
US. Following conditioning, 3 more 
daily sessions were employed to estab- 
lish the GSR threshold as well as a 
concurrent forced-choice discrimination 
threshold. To make the thresholds 
obtainable from the GSR and the verbal 
response directly comparable, the test 
patch could occur in either of two 
intervals designated by an auditory 
signal, Ss were required to report 
verbally which interval the light oc- 
curred in and their GSRs were con- 
currently recorded. If the GSR to 
the first interval was greater than the 
GSR to the second, this was considered 
the GSR’s choice on that particular 
trial. 

While these investigators were suc- 
cessful in obtaining stable GSR thresh- 
olds in their Ss, there was no indication . 
that the GSR gave a lower threshold 
than did either of the two verbal report 
methods. In fact, the verbal reports 
in most Ss were found to show @ 
deviation from the chance base line at 

3 . ae the 
lower stimulus intensity than © 
GSR’s. Also, it was noted that while 


>. 
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the verbal reports obtained an accuracy 
level of 100% at higher stimulus in- 
tensities, the GSR never achieved this 
high level of discrimination. This 
latter observation is important in that 
any attempt to claim greater sensitivity 
for the GSR would have to assume that 
Somehow a response which is less 
Sensitive or discriminating at above 
threshold intensities _metamorphoses 
on the stimulus range to become 
Nore sensitive at low stimulus inten- 
Sities, 
eA. be noted that the Dulany 
that T aks experiment does not prove 
fbe y s ISR is no more sensitive than 
ate ral report in obtaining a dis- 
sible =y threshold. It is quite pos- 
aat the many sources of error 
Re see in GSR recording whether 
of the aie equipment or to the nature 
‘air A response itself may prevent a 
iene T nn ot the two response 
hae =. Also, there is a possibility 
itsin on have had greater practice 1n 
o the od verbal responses as opposed 
Pinks SR for such a discrimination. 
he on at the present time, appear to 
5 a answerable possibilities but it is 
ing cn that the care taken in train- 
olds i Algae verbal response thresh- 
Xtengiy, experiment was much more 
periment than that employed in ex- 
indicator. that have used GSRs as an 
of unconscious perception. 


Th 
te Subception Effect 


tl There is a third class of experiments 
en S erroneously but nonetheless 
at evel nsidered to show discrimination 
k a below that available to aware- 
the Ks These experiments make use of 
tha eons experimental arrangement as 
Poach Ployed in the previous ap- 
Tespo, es. A nonvoluntary nonverbal 
to “Onse such as a GSR is conditioned 
Stim, ae suprathreshold stimulus or 
nteg 24d then the stimuli are pre- 
at about threshold value. The 


Ss are required to give a verbal re- 
sponse and the incidence of the con- 
current conditioned response is noted. 
However, in these experiments the 
conditioned response is not used to 
compute a threshold. Instead the data 
are analyzed in terms of whether or 
not a conditioned response occurs more 
often than chance would indicate in 
the absence of a correct verbal re- 
sponse. Lazarus and McCleary (1951) 
in their subception experiment were 
among the first to use this type of 
analysis as evidence for discrimination 
without awareness. 

They conditioned GSRs to 5 of 10 
nonsense syllables and then exposed 
the nonsense syllables individually in a 
tachistoscope at a duration level where 
verbal recognition was about 50% cor- 
rect, The average GSR to shock syl- 
lables was found to be greater than to 
nonshock syllables on those trials where 
the Ss were unable to give the correct 
verbal identification of the syllables. 
They interpreted their data in terms of 
subception: “a process by which some 
kind of discrimination is made when 
the S is unable to make a correct con- 
scious discrimination.” 

While nearly all of the discrimina- 
tion without awareness experiments 
that we have analyzed so far would 
have lent their data to this type of 
comparison, the novelty of the Lazarus 
and McCleary approach was the intro- 
duction of this method of analysis. It 
is not the same as determining a dif- 
ferential threshold for the GSR and 
the verbal response. Lazarus and Mc- 
Cleary did not compute separate thresh- 
olds. While they report that the 
tachistoscopic level was set so as to 
obtain approximately 50% correct ver- 
bal recongnitions, they do not report 
the differential accuracy of the GSR on 
classifying shock and nonshock non- 
sense syllables- Such information is 
important to our present discussion and 
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can be estimated from an unpublished 
replication of their study conducted in 
our laboratory. This replication was 
successful in duplicating the subception 
effect. But a comparison of the relative 
accuracy of the GSR and the verbal 
report shows that if an observer wished 
to predict whether a shock or a non- 
shock nonsense syllable’ had been pre- 
sented to the S and the observer could 
use only one of the S’s responses, the 
verbal report or the GSR, he would 
have been appreciably more accurate 
using the S’s verbal report. 

At the exposure level used in the 
replication approximately 45% of the 
verbal reports correctly identified the 
specific nonsense syllable. The most 
optimum weighting of GSR values 
gave approximately 60% correct clas- 
sifications into shock or nonshock cate- 
gories, but the chance expectancy for 
correct nonsense syllable identification 
for the verbal response is 10%, whereas 
for the dichotomous classification of 
shock vs. nonshock by the GSR, chance 
level is 50%. An observer attempting 
to determine whether the S had seen a 
shock or a nonshock syllable would 
have been correct in his predictions 
60% of the time using the GSR but 
using the verbal response he would 
have been correct 45% of the time 
plus one-half of 55% for a total of 
72.5%. (This assumes that the S would 
randomly distribute his errors on the 
verbal response between shock and 
nonshock syllables.) 

While the Lazarus and McCleary ex- 
periment does not demonstrate that the 
threshold of a GSR response is lower 
than the threshold for verbal identifica- 
tion, nonetheless, their finding that in 
the presence of an error mm the en 
response system the GSR response has 
better than chance accuracy has some 
important implications for a E 

of behavior without awareness. -++ se- 
where the present author (1956a) has 
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analyzed the subception experiment in 
terms of a multiple concurrent response 
model. In the Lazarus & McCleary 
experiment the S gave two concurrent 
responses to a stimulus—a verbal re- 
sponse anda GSR. The obtained sub- 
ception effect consists of a partial cor- 
relation between the stimulus and GSR 
when the verbal response is held con- 
stant. Conceived in this way it is 
possible to specify the general circum- 
stances under which such a partial 
correlation would be obtained. 

The necessary and sufficient condi- 
tions are fulfilled when both responses 
are independently correlated with the 
stimulus but less than perfectly, and 
less than perfectly correlated with each 
other. The independent correlation of 
the two responses with the stimulus 
means that both the verbal response 
and the GSR are responses to the 
stimulus as perceived rather than the 
GSR being a response to the verbal 
classification of the stimulus. Less 
than a perfect correlation between the 
verbal response and the GSR can be 
alternatively stated as saying that these 
two response systems have noncor- 
related error terms. Thus, variables 
which may affect the accuracy of use of 
verbal responses may not concurrently 
affect the accuracy of the GSR and 
vice versa. 

Formulation of the results of the 
subception experiment in terms of 
partial correlation enables us to state 
specifically just what the subception 
experiment demonstrated. It succeede 
in showing that both the verbal re- 
sponse and the GSR are each, at least 
to a degree, independently correlated 
with the stimulus and are partially 
independent of each other. In other 
words, the GSR is, at least partly, 
determined by the stimulus and me 
intervening perceptual process and Pa 
solely by the verbal response. This 


A 3 b 
relation can best he seen in terms 
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A. Stimulus — Perceptual Process — Verbal Response > GSR 


B. Stimulus — Perceptual Process > GSR — Verbal Response 


Verbal Response 
4 


r4 
C. Stimulus — Perceptual Process 


GSR 


Fic. 1. 


the diagrams in Figure 1. If the re- 
lation was as depicted in A, then it 
would have been impossible to have 
Obtained a subception effect. A partial 
correlation of the GSR with the stim- 
ulus holding constant the verbal re- 
Sponse could not have been obtained. 
Similarly if the relationship was as 
depicted in B, it would be impossible 
to find a significant correlation between 
ke verbal response and the stimulus 
Wen the GSR response was held 
constant, While the Lazarus and Mc- 
Seal data are not presented so as to 
oe whether or not a significant 
res ial correlation between the verbal 
he and the stimulus existed when 
eos was held constant, 1m the 
a ication of their experiment men- 
ned above a significant and apprecl- 
Bii correlation of this nature was 
ined, This would seem to rule out 
Thi relationship as diagramed in B. 
dia, leaves only the relationship as 
ican in C where we have depicted 
east SR and the verbal response as at 
both Partially parallel response ial 
Perea pendently responding to the 
Ptual process. ; 
he author has carried out a series 
€xperiments in his laboratory to 
an, ae partial correlation formulation 
a a see what implications it has for 
Ness cept of behavior without aware- 
Succe The subception effect has been 
Coney lly obtained in studying the 
enera t, GSR and verbal response 
Size 2 ization to a series of different 
tw. luares (Eriksen, 1956b). Here 
as found that when the verbal 


est 


response was held constant, a gen- 
eralization gradient for the GSR as a 
function of square size was still ob- 
tained. In succeeding experiments 
verbal responses together with a con- 
current lever movement response were 
studied (Eriksen, 1957, 1959). Again, 
if the instances where the verbal judg- 
ment or response was incorrect were 
analyzed the concurrent lever move- 
ment response showed better than 
chance accuracy. The effect also has 
been successfully obtained where the 
concurrent responses were an affective 
judgment of pleasant and unpleasant 
together with a specific verbal identifi- 
cation of tachistoscopically exposed 
stimuli (Eriksen, Azuma, & Hicks, 
1959). In all of these studies a sig- 
nificant partial correlation was obtained 
between either response and the stimuli 
with the other response held constant. 
In no case, however, was it found that 
the nonverbal judgment gave a lower 
threshold or more accurate discrimina- 
tion than did the verbal judgment. 
These studies were also successful in 
indicating several variables that con- 
stitute the sources of noncorrelated 
errors between two concurrent re- 
sponse systems. Thus, it was found 
that distracting signals_will introduce 
increased error into GSR responses 
without affecting the accuracy of con- 
current verbal responses and similarly 
the number of competing alternative 
verbal responses affects the error of 
aent without concurrently 


this judgn 
influencing the accuracy of the GSR 


(Eriksen, 1956b). 
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On the basis of the above evidence 
the subception effect as an effect is 
real enough but it has nothing to say 
and offers no proof concerning the 
capacity of the GSR to reflect a more 
sensitive discrimination than that which 
is available through the Ss’ verbaliza- 
tions. Instead the subception effect 
focuses attention upon the limitations 
of verbal responses in conveying the 
individual’s perceptual experiences. By 
formulating the effect in terms of 
partial correlation in a concurrent re- 
sponse system model, we have the 
advantage of being able to direct our 
attention upon sources of noncorrelated 
errors between two or more concurrent 
response systems. We have already 
indicated some of the factors that will 
lead to increase in noncorrelated errors 
between concurrent responses and it 
is interesting to speculate on the effect 
that variables such as differential re- 
ward and punishment might have upon 
decreasing the correlation between 
verbal and certain nonverbal responses. 

To say that the subception effect 
does not provide evidence for dis- 
crimination at levels below awareness is 
not to deny that the phenomenon is 
important to our general concept of 
awareness. There is reason to suspect 
that a considerable amount of the evi- 
dence that the clinician finds for un- 
conscious processes is in the nature of 
discrepancy between concurrent re- 
sponses. The clinician may note the 
Ss’ autonomic responses and evidences 
of emotion do not correspond, i.e., do 
not correlate very well, with his verbal 
statements. This lack of correlation 
is a basis for the clinician's inference 

cious processes or defensive 


of uncon Aa 
mechanisms at concurren 
Id seem the 


response system model wou ls 
efficient way of conceptualizing this 
situation and suggesting ways of = 
perimentally investigating the clinica 
phenomena. 


work. 
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Indirect Effects of Subliminal Stimula- 
tion upon Behavior 


A recent experiment by Dixon 
(1958) has introduced an interesting 
and novel method for demonstrating 
effects of stimulation below the aware- 
ness threshold. Dixon determined the 
brightness threshold for a vertical and a 
horizontal line by the ascending method 
of limits. Having obtained a threshold 
Dixon told his Ss that he was going to 
expose words to them at a brightness 
level where they would not recognize 
the words. He then exposed 10 words 
or phrases (6 were emotional in con- 
notation and 4 neutral) at a brightness 
level .3 log ft. lamberts below the 
threshold obtained for the vertical and 
horizontal lines. The Ss were told to 
give a word guess to each exposure 
even though they would not see the 
word. Each of the 10 words or phrases 
was exposed four times during a 
session and then at a later session the 
Ss were presented the list of stimulus 
words and their responses were read to 
them. They were instructed to try to 
match their responses with the stimulus 
words using any type of associative 
connection they wished. Dixon’s hy- 
pothesis was that the words exposed 
below threshold would have been un- 
consciously perceived in terms of their 
meaning and would thus influence the 
associations or responses the 5 made. 
Five of the seven Ss gave results in 
this predicted direction. In fact, when 
the individual probabilities for each S 
were summed through Ss the results 
were significant at a high level. 

Dixon’s approach to the study of 
discrimination without awareness is a! 
interesting departure in methodology 
but subsequent work indicates his re- 
sults are artifactual. Fuhrer and Eriksen 
(1960) in an extensive replication of 
the experiment confirmed that Ss could 
match associations given to stimu 


DISCRIMINATION AND I 


that had been exposed at illumina- 
hon levels too weak for correct identifi- 
cation, However, systematic employ- 
Ment of controls showed this effect to 
be a result of structural cues such as 
word or phrase length rather than 
unconscious perception of stimulus 
Meaning.” 

Fuhrer and Eriksen were unable to 
Obtain significant or suggestive results 
at brightness levels of stimulus ex- 
Posure below the threshold for vertical 
and horizontal lines obtained by the 
ascending method of limits or at the 
threshold obtained by a modification 
of Blackwell’s forced-choice technique. 
However, when exposure was at a 
brightness level where Ss could detect 
the Presence of a stimulus but not 
identify it, significantly better than 
chance matching occurred. That the 
ting by these Ss was due to struc- 
oe cues rather than the perception of 
Meaning was shown by the data’trom 
* control group. Using the same ex- 
Posure level but now exposing the 
q Mulus words and phrases upside 
— and backwards, the control Ss 

e also found to give significantly 
etter than chance matching of their 


w ciations to stimulus words. There 
aS no significant difference in per- 


langg between this control group 
his previous experimental group 
el had used the same exposure 
Plays: Since it seems extremely im- 
ion Sible that unconscious discrimina- 
is capable of perceiving the meaning 
Words that are flashed upside down 
tion backwards at very low Toe 
Perfo evels, it would appear that the 
tbs of both groups 1s at- 
outable to the cues perceived con- 
ping the length of words or phrases. 
ut to check upon this possibility a 
fren control was employed. „For 
Wo q Perimental group a control list of 
Mm ae Similar in structure but different 
aning to the actual stimulus words 


and 


fur 
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was constructed. Thus, for the stimu- 
lus phrase “male sex organs,” the 
structurally similar control phrase was 
“made six organs.” In the second 
session, when the experimental Ss were 
asked to match their associations to 
the stimuli, they were also presented 
with this control list of structurally 
similar words together with the ex- 
perimental words. Analysis of the re- 
sults revealed no tendency for this 
group to match their associations with 
the correct words over the structurally 
similar control words. 

The experimental work so far offers 
no convincing evidence that the human 
observer can respond differentially to 
stimulation that is of an intensity 
level below that at which a discrimi- 
nated verbal report can be obtained. 
In other words, when adequate con- 
trols are provided, no response has 
been found to yield better absolute dis- 
crimination than a verbal response. 
However, there are most likely circum- 
stances where a nonverbal response 
may be a better indicator than verbal- 
ization. No psychophysicist, for ex- 
ample, would expect an experimental 
arrangement where the S pressed a 
button directly in front of him to in- 
dicate “No” but had to get up and 
walk across the room to press a button 
to indicate “Yes” to yield as low a 
threshold as a situation in which both 
buttons were directly in front of the S. 

Such response biasing can occur in 
less obvious ways. It is quite possible 
that anxiety conditioned to certain re- 
sponses may bias their occurrence, 
Similarly, artificial restrictions on the 
number of verbal labels available can 
have a similar effect as well as delib- 
erate response suppression due to social 
taboos associated with the response. 
But it is to be noted that under these 
circumstances one would not conclude 
evidence for a super unconscious dis- 
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crimination of stimulus intensities too 
weak to elicit verbal discrimination. 


AWARENESS AND THE SUBJECT’S 
SUBJECTIVE CONFIDENCE 
There is one other experimental ap- 
proach to the problem of awareness 
that deserves brief consideration. It 
differs from the other methods in that 
a discriminated verbal report is not 
used as the criterion or definition of 
awareness. Instead awareness is de- 
fined in terms of the S’s statement as 
to whether or not he feels he has per- 
ceived the stimulus or whether he feels 
he is merely guessing. The evidence 
consists of showing that when the S$ 
expresses zero confidence, e.g., sub- 
jectively pure guessing, he nonetheless 
responds correctly to the presence or 
absence of the stimulus at better than 
chance level, 
Peirce and Jastrow (1885) were 
among the first to report this phe- 
nomenon. Working with the difference 
limen on weight judgments, Jastrow’s 
data indicate that in the presence of a 
subjective report of “no confidence,” 
his judgments of heavier or lighter 
were still correct significantly better 
than chance. Since then this phe- 
nomenon has been a frequent finding in 
a number of experiments. In fact, as 
Adams (1957) points out, it is a stable 
enough phenomenon to permit its use 
asaclassroom exercise. Representative 
of experiments obtaining results on this 
aspect of awareness are those of Fuller- 
ton and Cattell (1892), Miller (1939), 
Coover (1917), and Williams (1938). 
Although these experiments are fre- 
quently and uncritically used to Support 
contentions concerning discrimination 
without awareness where awareness 
is defined in terms of a discrimi- 
nated verbal report, it is to be no 
that these experiments do not fit bi 
definition. Here a discriminated ie 
report is obtained. The evidence for 
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unawareness is in terms of the S’s 
report of lack of confidence in his ver- 
bal judgments. While this subjective 
confidence in his reports is an interest- 
ing problem, it has no operational 
similarity to the type of experiments we 
have considered above. An equation of 
awareness with verbalization, by defi- 
nition, means that any verbal judgment 
is one made with awareness irrespective 
of whether the S has little or no con- 
fidence in the judgment. 

The major difficulty with experi- 
ments of this kind is that they place 
upon the individual § the responsibility 
for establishing the criterion of aware- 
ness. We have no way of objec- 
tively knowing when S A says a given 
judgment was a pure guess and S B 
makes a judgment and states it was a 
pure guess that the two Ss are talking 
about the same phenomena or that they 
are using the same criteria for guessing. 

In work in our laboratory we have 
found very marked individual dif- 
ferences for the intensity range be- 
tween where an S shows better than 
chance guessing in his responses and 
the point where he begins to report 
confidence in his judgments. For 
some Ss the stimulus intensity where 
responses are better than chance almost 
coincides with their confidence thresh- 
old. For other Ss there is a great 
range between where guesses are above 
the chance level and the point where 
confidence is reported in these judg- 
ments. While research on this partic- 
ular aspect of awareness may be indeed 
fruitful, nonetheless, at the present 
stage of our knowledge we are not 
justified in lumping together data from 
these experiments with those that have 
been discussed in the preceding sec- 
tions. Not only are they operationally 
dissimilar but more importantly, these 
marked individual differences indica 
that as long as we rely upon the 
himself to report whether or not 4 
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verbal judgment was made with aware- 
ness, we are in fact using as many 
different criteria of awareness as we 
have experimental Ss. 


BEHAVIOR WITHOUT AWARENESS TO 
Anove THRESHOLD CUES 


While we have been unable to find 
evidence for a supersensitive discrimi- 
nating unconscious, the evidence that 
behavior can be directed by above 
threshold cues of which the S is un- 
aware, is not only more plausible but 
Somewhat more substantial. Common 
Sense tells us that we are constantly 
utilizing cues of which we are unaware 
n our perception of depth, of shape 
and size constancy, as well as a num- 
er of cues that guide such complex 
Motor habits as piano playing and 
eae automobiles. One of the factors 

aking the elimination of such habits 
i fingernail biting or other manner- 

ms hard to eliminate is that they tend 
or without our awareness. Ex- 
not ments appear to have demonstrated 
nae the use of cues without aware- 
A Dut also the lack of awareness 

Some instances for the response the 
Sone making, for the reinforcements 
cote the response and in me 
Wee the contingent relationship be- 
m n cues, responses, and reinforce- 

ents, 
ay of the experiments providing 

ence on the effects of cues with- 
awareness were considered by 
tas as demonstrating effects of 

iminal stimulation upon behavior. 
in. Will be seen the use of subliminal 
grag nection with these cues was 
ee since the particular api 
the X methodology did not con air 
Permig ay controls or op R 
aS sul ae classification 0 these = 
Of 5 o imina]. In fact, on the s 
to €r evidence there is every reason 
feo: SPect that these cues, when ef- 

® were above a chance discrimi- 


the 


nation level for verbal report when 
assessed by proper psychophysical 
techniques. 

The methodology employed in ex- 
periments in this area makes use of one 
or another of a few basic techniques. 
In most cases the cue is presented 
without forwarning the S that it will be 
present. Typically it is presented at 
an intensity so low as not to command 
attention. In other experimental ar- 
rangements the cue may be of a more 
obvious nature but the S is prevented 
from attending it by misdirection as to 
the purpose of the experiment or mis- 
direction concerning the nature of the 
experimental task. In fact, misdirec- 
tion of attention seems to be a crucial 
consideration in experiments of this 
nature, suggesting that perhaps we are 
dealing more with a concept of atten- 
tion than one of awareness, if indeed 
the two concepts can be distinguished. 

One may distinguish two separate 
lines of investigation using incidental 
cues. The first is concerned with the 
effect of incidental cues upon behavior 
that was learned outside of the experi- 
mental situation. For example, the 
effect of the incidental stimulation of 
the word “angry” upon the inter- 
pretation of ambiguous profiles might 
be studied? The other approach is 
concerned with the problem of learning 
without awareness. The first approach 
we will term performance without 
awareness to distinguish it from the 
problem of learning without awareness. 

Besides sharing the common use of 
the technique of incidental stimulation 
both lines of research investigation also 
share a common problem in the defi- 


nition of awareness. Both must rely 
upon the interrogation of the S con- 
cerning his awareness in the experi- 
mental situation. As was pointed out 


2T am indebted to George S. Klein for 
the term “incidental stimulation.” 
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earlier in this paper such interrogation 
is fraught with difficulties. The value 
of the concept of awareness depends 
upon the adequacy with which the 
E surmounts these problems. The 
operational meaning of awareness will 
vary markedly from one experiment 
to the next depending upon the ade- 
quacy of the E’s assessment of the 
Ss’ verbalizations and the precautions 
taken to insure that they are valid 
and meaningful. 


Performance without Awareness 


‘There have been several experiments 
demonstrating that unaware cues are 
capable of producing perceptual illu- 
sions. Of the series of studies utilizing 
the Mueller-Lyer Illusion (Dunlap, 
1900; Titchner & Pyle, 1907), the one 
by Bressler (1931) appears to be the 
most thorough and best controlled. He 
found evidence of illusory effects in the 
judgment of the lengths of the two lines 
when the directional arrows were so 
faint that the Ss reported that they 
were unaware of their presence. 

Miller (1939) and Perky (1910) 
projected geometric forms on the back 
of a ground glass screen at low inten- 
sities and found that the patterns pre- 
sented were influential in determining 
the Ss’ choice of geometric pattern. 
The effect was found even when the Ss 
were unaware that a pattern was actu- 
ally being projected on the screen. 

More recently investigators have 
made use of the metacontrast (Alpern, 
1952) phenomenon as a method of pre- 
senting cues in such a way that the $ 
would be unaware of them. Smith and 
Henriksson (1955) presented a stimu- 
lus consisting of fan shape lines, very 
briefly in a tachistoscope, and then, 

almost immediately after, flashed k 
i timulus. While their Ss did 
square stimu L haned. tines 
not report seeing the fan shat ee 
they did tend to see the square as 
trapezoid, Kolers (1957) has pre- 
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sented tachistoscopically sets of designs 
that required abstraction or conceptu- 
alization and found that if cues to the 
solution of the problems were flashed 
first, these cues were capable of in- 
fluencing the type and direction of the 
solution even though Ss reported they 
were unaware that the cues had been 
presented. 

Using a modification of this meta- 
contrast technique, Smith, Spence, and 
Klein (1959) presented a line drawing 
of an expressionless human face to 
their Ss. The face was continuously 
presented to the Ss except for brief 
interruptions where the word happy or 
angry was briefly flashed. They re- 
port suggestive evidence that the Ss’ 
interpretations of the face were more 
pleasant following exposures of the word 
happy than following exposures of the 
word angry (12 of 20 Ss mentioned 
more pleasant characteristics of the face 
following exposures of the word happy 
than they did following exposures 0 
the word angry), Exposures of the 
word angry however, had little or no 
effect upon the Ss’ interpretations of 
the face. 

It is to be noted that the experiments 
described above in this section have 
been interpreted as demonstrating the 
effects of subliminal cues upon per- 
ception. If by subliminal we mean that 
the intensity of the cues was too weak 
to elicit a verbal report threshold under 
careful psychophysical conditions, then 
none of the above experiments permit 
interpretation in terms of subliminal 
stimulation. In Bressler’s experiment 
even the faintest lines he employed, 
when studied independently of the il- 
lusion, were found to yield better than 
chance discrimination by his $s- 
Vinacke’s (1942) data clearly show 
that the Miller and Perky results are 
not- obtainable when the geometric 
forms are presented at an intensity 
level below a rather crudely determine 
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Psychophysical threshold. Similarly the 
Metacontrast effect seems to depend 
ot the first stimulus being at a level 
a rg have been detectable were 
n out?” by the second 
a a se And in the Smith et al. 
Bow: o servations upon a control 
fe Be Ss who had been alerted to 
at act that words were being flashed 
‘cated that most of the word ex- 
= for the experimental group 
; eae the level for detectability 
word. No evidence is presented 


to show 2 
teas that the presentation of the 
lad happy and angry had a sig- 


of en effect upon the interpretation 
tions igs when presented at dura- 
den ow the detectability level. 

Consider When the above studies are 
dental red as demonstrations of inci- 
ion ee than subliminal stimula- 
With au conclusions must be accepted 
are ao In most cases, the effects 
liable Sht even though statistically re- 
Studies , Careful replication of these 
be SS 'S needed before confidence can 
And then, 

hg replication substantiates the 
Š careful parametric analyses 
; be undertaken to explore not 

i tom Parameters of the experimental 
the <0 that create conditions where 
the inchs inattentive to or unaware of 
Stiented wt stimulus but also studies 
Steg rindi studying the complexity of 
S can be utilized in this manner. 
Stin G effects of this incidental 
the a Dn should be compared with 
actua, StS when the stimulation 1S 
T y 


that lere 


Jeca 5 
Meser ause of their particular nature 
a 5 Y . . ni d 
Te the SPecial consideration. These 


b are 

rectal; XPeriments based upon the 
motte €nomena. In 1917 Poetzl 
tions: „a series of studies on his 


s 
m which he exposed complex 


pictures tachistoscopically and in- 
structed his patients to dream about 
the picture. He found that items in the 
picture which were not reported in 
intentional recall tended to occur in the 
dreams. Interest in this phenomenon 
was revived briefly in the early 1930's 
(Malamud & Linder, 1931) and then 
lay dormant until recently revived by 
Fisher (1954). An experimental re- 
port by Shevrin and Luborsky (1958) 
illustrates the methodology of research 
on this phenomenon as well as the gen- 
eral findings and methodological prob- 
lems encountered. These investigators 
tachistoscopically presented a complex 
scene to their Ss following which they 
asked for a description and drawing of 
what the S had seen. Ss were then 
asked to bring in their dreams in a 
subsequent session. In this subsequent 
session further intentional recalls were 
obtained and the objects obtained in 
intentional recall were compared with 
the objects that appeared in the Ss’ 
dreams. Since many of the Ss did not 
report dreams, they were asked to 
conjure up a mental picture or image 
which they were then asked to draw 
and to describe. 

Shevrin and Luborsky report that a 
significant number of items and char- 
acteristics of the tachistoscopically ex- 
posed scene that were not mentioned 
in the intentional recalls occurred in 
the dreams or the S’s images. These 
findings were interpreted in terms of 
the psychoanalytical concept of day 
residues. 

The complexity of these studies and 
experimental arrangements poses many 
methodological problems and gives rise 
to the possibility of numerous artifacts. 
One of the first problems encountered 
is the frequency or base rate with which 
different objects or items spontaneously 
occur in dreams and images independ- 
ent of a prior pictorial stimulation, 
This is a particularly acute problem 
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since these investigators used a rather 
loose criterion for determining whether 
objects that had appeared in the picture 
occurred in the dreams or images. 
Thus, if the picture contained trucks 
and the S in his dream or image 
mentioned a bicycle, this was scored 
as an additional memory recovery since 
both represented a means of conveyance. 

A second problem involved in the 
interpretation of Poetzl phenomenon 
data concerns the previously mentioned 
problem of the S's’ subjective confidence 
level. When instructed to recall or 
describe what had been seen in the 
tachistoscopically presented picture we 
may assume that the S has a cer- 
tain veridical orientation. That is, he 
wishes to be correct. As we have 
previously seen, Ss have different 
criteria for reporting what they believe 
to have existed. Some Ss may not 
mention an item unless they have fairly 
high confidence that it was present 
while others may use a lower confidence 
level. However, when freed of the 
veridical orientation required by in- 
tentional recall and asked to image, 
most Ss would be expected to have 
freer criteria for what they would re- 
port. Thus, items which they were 
not certain were in the picture or for 
which they had low confidence are 
more apt to appear in the image than 
under instructions for intentional 
recall. 

A third problem is what we might 
call the contingent probability of items 
going together in our enviornment. To 
illustrate what I mean by this, let us 
consider an S who has no recall of 
een a boat or a wharf in a 
he does describe a body of 
s deliberate recall. When 
age, It seems 


having s 
picture but 
water in hi c 
he is now asked to 1m 
likely that his associations are apt to 
be conditioned by the thought of water. 
Thinking of water it 1s likely that as- 
sociations might run toward boats and 
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wharfs and other objects that have in 
his past experience tended to be as- 
sociated with a body of water. Since 
these objects would also have a fair 
probability of having actually been in 
the picture, this would lead to an in- 
crease in the recalled items in the image 
condition which was spurious due to 
the past learned associations among 
objects in our environment. In sub- 
sequent work on this problem a satis- 
factory control for this possibility might 
be obtained by having control Ss who 
do imaging after observing the ex- 
perimental Ss’ drawings and descrip- 
tions of the picture. Another control 
possibility would involve the use of 
stimulus pictures composed of hetero- 
geneous objects whose inclusion was 
determined by random numbers. 


Learning without Awareness 


On the basis of the large number of 
studies during the past few years on 
learning without awareness, it would 
seem that the phenomenon itself should 
be well established by now. No at- 
tempt will be made here to survey 
the large literature. This has been 
well done by others (Adams, 1957; 
Krasner, 1958). Rather, this research 
area is mentioned in order to show 
that the phenomenon is not as well 
established as is generally supposed and 
also to indicate some of the directions 
that future research in this area must 
take. 

As was also the case with research 
on performance without awareness: 
learning without awareness studies 
have been plagued with the problem O 
adequate operational criteria for aware 
ness. Many of the experiments 1” 
this area have been concerned primarily 
with operant conditioning and little 0" 
no concern has been given to the defi- 
nition of awareness. In many case 
the only criterion is the statemen? 
by the Æ that the Ss were 


-> 


DISCRIMINATION AND LEARNING WITHOUT AWARENESS 


aware or that the Ss were unable to 
ee the correct response class. A 
ton aan have focused primarily 
Ere a phenomenon of awareness and 
Ne evidence for learning without 
Atenes has not been impressive 
Poeding Eriksen, 1960). Adams 
eee m his survey has pointed out 
a as the inadequacies in this field 
Hon en such as the lack of replica- 
ERN experiments, and more partic- 
Fle y the failure of Es to adequately 
Out the possibility of correlated 
ypotheses, 

os recently Dulany (in press) 
Ferhat led into question many of the 
that S conditioning studies by showing 
irla results could be attributable to 
o ated hypotheses and the transfer 
Drp ausy acquired verbal habits. 
typical, used what has become a 
studi verbal conditioning procedure, 
‘gig the influence of the E's 
age n hmmm” upon the S's plural 
incres output. He found a significant 
Š a in plural noun output by his 
o i though none of them were able 
Sory rbalize the correct response cate- 
unawa In this sense, then, they were 
ata ES But Dulany analyzed the 
at rom his Ss further. He found 
in the only Ss who showed a learn- 
o €ffect were those who were able 
Bs Verbalize to the effect that the 
tha, VMM hmmm” indicated to them 
a ned were associating in the 
Strate category. He further demon: 
tend 4 that associations to plural nouns 
associa P other plural nouns whereas 
this Aeons to singulars do not show 
Dreti o dency, Since the Ss were inter- 
dica “8 the “mmm hmmm” as an 1m- 
ing -ON they were to continue associat- 
h mn that category and the “mmm 
m? followed a plural, this effect 
learnin be sufficient to account for the 

ù Sea. shown by his Ss. 
Sss ag rch on learning without aware- 
well as unconscious phenomena 
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in general has been characterized by 
uncritical acceptance. Diven’s (1937) 
experiment on unconscious condition- 
ing of anxiety has been extensively 
cited and used as the basis for con- 
siderable theorizing concerning psycho- 
dynamics. Yet as Lacey and Smith 
(1954) have so clearly pointed out the 
Diven experiment did not contain the 
basic data comparisons necessary for 
a logical conclusion of conditioning 
without awareness. Using a more 
sophisticated design, Lacey and Smith, 
however, reported that they obtained 
evidence of unconscious conditioning 
of anxiety. But here again, Chatterjee 
and Eriksen (1960) have shown that 
Lacey and Smith’s evidence on con- 
ditioning may well have been due to an 
artifact arising from the method of 
computing ‘conditioning scores. These 
latter investigators were in general 
successful in replicating the Lacey and 
Smith findings but due to a more care- 
ful assessment of the Ss’ verbalizable 
awareness, they were able to show that 
the autonomic conditioning was no 
more precise nor specific than the Ss’ 
verbalizations. 

Perhaps our tendency to uncritically 
accept experiments on unconscious 
phenomena may be due to our firm 
belief in the existence of the uncon- 
scious. But it is to be noted that 
while we might firmly believe in 
the phenomena of classical condition- 
ing this does not discharge an E 
from the duty of showing that his 
particular conditioning results are not 
attributable to the artifact of “pseudo 
conditioning.” There would seem to 
be little doubt that a considerable 
amount of human behavior occurs with- 
out awareness of the behavior at the 
time of its occurrence but it is to be 
noted that this does not logically re- 
quire that behavior 1s learned with- 
out awareness. It may well be that 
the awareness process is biologically 
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adapted to be sensitive to features in 
the environment that are important to 
the organism and that any circum- 
stances important enough to the organ- 
ism to produce learning would not go 
undetected by awareness. This is not 
the same as saying that awareness is 
necessary for learning. Rather, it sug- 
gests that any situation where the cues 
and the reinforcements are salient 
enough to produce learning, will not 
escape detection by awareness. 

It would seem that on the basis of 
the available evidence learning without 
awareness in human Ss is not ade- 
quately proven. A considerable re- 
search effort is needed not only to ar- 
rive at acceptable operational criteria 
of awareness but to thoroughly explore 
the different types of learning situations 
to determine what if any kind of learn- 
ing can take place without awareness. 
It may be that only simple conditioned 
responses are capable of learning with- 
out awareness or it may be that fairly 
high level concepts can be acquired. 
More attention in research needs to be 
paid to the awareness process itself, 
particularly in terms of criteria and 
independent correlates of awareness. 
Also, systematic parametric analyses of 
the experimental situation and arrange- 
ments where learning without aware- 
ness occurs must be undertaken. These 
latter studies should go far in sug- 
gesting an improved concept of aware- 
ness or attention. 


SUMMARY AND CONCLUSIONS 


This paper has surveyed the ex- 
perimental work on discrimination, per- 
formance, and learning without aware- 
ness. The methodology employed in 
these experiments has been critically 
examined and the nature of the evi- 
dence evaluated. Some of the prob- 
lems inherent in the definition of 


awareness have been considered in re- 
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lation to the experimental work and 
an attempt has been made to show that 
there are a number of separate ques- 
tions that have traditionally been 
lumped together under “unconscious” 
phenomena. Artifacts involved in ex- 
periments on subliminal perception and 
subception were pointed out and alter- 
native interpretations for these ex- 
periments were suggested. 

It was concluded that at present 
there is no convincing evidence that the 
human organism can discriminate of 
differentially respond to external stim- 
uli that are at an intensity level too low 
to elicit a discriminated verbal report. 
In other words, a verbal report is as 
sensitive an indicator of perception as 
any other response that has been 
studied. 

Consideration of the clinical evidence 
for the concept of the “unconscious 
suggests that an assumption of “sub- 
liminal discrimination” is not required 
to account for the observed phenomena 
upon whichthe concept is based. Instead, 
the evidence noted by the clinician O" 
student of personality would seem tO 
be explainable in terms of either thé 
demonstrable partial independence ° 
different response systems to fragmen- 
tary perceptual cues and memory traces 
and/or to lack of awareness to above 
threshold cues. 

A science of personality is not fur- 
thered by the frequent tendency ° 
psychologists to discuss the “uncon” 
scious” with all the ambiguity aR 
reverence that religions accord to t" ; 
soul. There is great need to spell mi 
explicitly the assumed characterist!© 
of the unconscious and to search ne 
explanations of so called unconscion? 
phenomena in terms of more camo 
place psychological variables. To A 
so may destroy the titillating myste t 
that the unconscious seems to hold 
then that is the business of science. 
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poy sophisticated observers are 
Bea that a wide gap exists between 
ee ad theory and the techniques 
ality ocedures used to measure person- 
pe variables; The low level of pre- 
Rah n of such testing procedures may 
the a a function of the failure to apply 
cn leories themselves to the methods 
ph ap Particularly, it is a 
mina € to apply an analysis of the deter- 
RN of behavior in general to the 
. ¢ test taking behavior of the sub- 

Jects (Ss), 
mee gap itself may be described as 
relates three aspects. The first of these 
used S to the question of the constructs 
Whiel in the theory and the constructs 
ure h the tests were developed to meas- 
dey; _In many instances rather than 
ising tests which measure specific 
“eae constructs which are care- 
ie Bees and for which the test 
refer tor can be understood as a logical 
o ie the descriptive constructs used 
e Haig test response do not logically 
Dut a to the new theoretical constructs 
See bent or twisted to measure the 
ch a ables. That is, test constructs 
Spons were used to classify test re- 
läte” 5 developed earlier are trans- 
iable to be measures of the new var 
Bret Examples of this are use of 
aenach variables such as color, 
fr ament; and shading which arose 
Such ager type theory, to measure 
Bidity Onstructs as “ego strength,” “riz 

” and “tolerance for ambiguity. 


al x i 
hig Rete indebted to Shephard Liverant for 
this ee comments and suggestions about 


The Rorschach was not developed to 
assess such variables and in translating 
or twisting older methods of Rorschach 
scoring to measure these variables it is 
quite likely that a loss of prediction 
results. 

A second aspect of this gap between 
personality theory and methods of 
measurement of personality lies in the 
testing procedure itself. For example, 
where the theory may emphasize the 
significance of differences in behavior 
in the presence of authority figures vs. 
peers or males vs. females, the formal 
test procedure assumes no such vari- 
ables are important. That is, no differ- 
ence in interpretation of test results 
follows from the fact that the examiner 
may have a different social stimulus 
value in one case than in another or 
under one set of conditions rather than 
another. In such an instance although 
the theory itself recognizes (and ex- 
perimental data such as Gibby, Miller, 
& Walker, 1953, and Lord, 1950, sup- 
port) the importance of the effect on 
behavior of the nature of the social 
stimulus, the test procedure itself does 
not take it into account. An example 
would be in the application of Murray's 
theory (1952) which sees behavior as 
a function of internal needs and envi- 
ronmental presses. Tests have heen de- 
veloped using this theory (Thematic 
Apperception Test, as clinically used ; 
Edwards Personal Preference Sched- 
ule) which presume to measure the 
strength of various needs but fail to 
account for the test behavior as a func- 
tion of the testing situation itself (an 
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environmental press) as one of the var- 
iables determining the test behavior. 
Other characteristics of this discrepancy 
between theory and test taking proce- 
dure will be discussed more fully later. 
A third aspect of the gap lies in the 
area of inference from test behavior. 
The issue here is that there is an ab- 
sence of logic or contradiction in the 
assumed relationship between what the 
S does, or test behavior, and what is 
inferred from such behavior. Peak 
(1953) and Butler (1954) among oth- 
ers have discussed this problem earlier. 
Jessor and Hammond (1957) have 
noted such a gap in the inferences made 
from the Taylor Anxiety Scale. An- 
other example could be drawn from the 
Edwards Personal Preference Schedule 
(Edwards, 1953) in which Ss are 
asked to state their preferences for dif- 
ferent kinds of goals but there is no 
theoretical basis provided to allow one 
to make predictions about nontest be- 
havior from such preferences. Of 
course, it can be assumed that the pref- 
erences have some one-to-one relation- 
ship with some criterion behavior, but 
it is unlikely that even the test authors 
would make such a theoretical commit- 
ment. In other words, it is not clear 
exactly what can be predicted or should 
be predicted from the test responses. 
Individuals using such tests, however, 
can defend themselves by stating that 
prediction is after all an empirical mat- 
ter and one has to find out what can 
or should be predicted. It is likely, 
however, that the construction of tests 
which are systematically or theoretically 
pure, in that they are devised to meas- 
ure specific variables or to make specific 
predictions, with the method of meas- 
urement and inference a with 
the theory will ultimately pravide maeh 
better predictions of behavior as ya as 
a test of the utility of the theory itsel, 
The purpose of this paper 15 nee 
plicate some of the implications OF 4 
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social learning theory of personality for 
the measurement of personality vari- 
ables. The particular point of emphasis 
is the measurement of goal directed be- 
havior conceptualized in social learning 
terms as need potential. Secondarily, 
the paper aims at illustrating the nature 
of the relationship between testing pro- 
cedures and inference about behavior 
more generally. 

In social learning theory (Rotter, 
1954) the basic formula for the predic- 
tion of goal directed behavior is as 
given below: 


BPos,.n, = Mirna & RVen) C1 


The formula may be read as follows: 
The potential for behavior . to occur 
in Situation 1 in relation to Reinforce 
ment a is a function of the expectancy 
of the occurrence of Reinforcement @ 
following Behavior x in Situation 1 and 
the value of Reinforcement a, in Situa- 
tion 1. Such a formula, however, 1$ 
extremely limited in application for it 
deals only with the potential for a give? 
behavior to occur in relationship to * 
single specific reinforcement. The pre 
diction of responses from personality 
tests requires a more generalized con” 
cept of behavior and the formula fon 
these broader concepts is given below: 
BP(s—n) 56, 


=n) Rean) 
= F(E -nisan Rai) 


& Riain [2] 


This may be read: The potentiality al 
the functionally related Behaviors ¥ i 
n to occur in the specified Situations a 
to n in relation to potential Reinforce” 
ments a to n is a function of the expect 
ancies of these behaviors leading 
these reinforcements in these situat! 
and the values of these reinforcemen s 
in these situations. For purpose 
simplicity of communication, tlie t^ en 
basic terms in this formula have he 
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ppically referred to as need potential, 
ae r 

"aie of movement, and need value 
as in the third formula below : 


NP = {(FM & NV) [3] 


$ — formula the fourth concept, that 
plicit, pat situation, 1s im- 
ee ne variables referred to above 
Se eeu for measurement have 
freon shed and further explicated ina 
ous publication (Rotter, 1954). 
hi ae to illustrate the social learn- 
trent heory implications for measure- 
ie personality and for the meas- 
onthe goal directed behavior, it 
appro, expedient to consider three basic 
on to this problem based on the 
icall vet of determinants used theoret- 
ly to predict such goal directed be- 
a and the problems, limitations, 
advantages of each approach. 


Stren 
TRENGTH oF NEED AS A BASIS FOR 
PREDICTING BEHAVIOR 


e tong many esoteric systems of 
of ye utilize essentially the strength 
metho, , drive, or instinct approach, this 
or — can be described as the simplest 
a tapi sci approach, Basically 
ike of constructs are formulated 
emnin on an a priori basis than 
c ere or at least on a presumed 

haga rather than experimental basis. 
instin descriptive terms may refer to 
OF ye a drives, needs, factors, entities, 
Sota Pad of the mind (i.e. the Minne- 
dwa ultiphasic Personality Inventory, 
ule ards Personal Preference Sched- 
ete Rorschach, Humm-Wadsworth, 
there ; They all have m common that 
istic be more than one basic character- 
istic, hat these two or more character- 

lenis in some way measurable — 
Vidual’s han and presumably the i i- 
he Gane behavior can be predicted rom 
i aracteristics which are “stronger 


and : 
“ the which are 


i characteristics 
W aker 2 ara 


Sometimes the personality disposition 
can be predicted from the strength of 
other constructs according to either 
simple or complex statements of rela- 
tionship formally postulated, hypothe- 
sized, or informally asserted. These 
relationships can become quite complex 
as in psychoanalysis or quite esoteric as 
in Szondi’s explanation that motivated 
behavior is a result of the interaction of 
dominant and recessive genes. Because 
the methods of measurement in some 
instances cannot be direct, as in the 
assessment of unconscious drives in 
psychoanalysis, an impression of great 
complexity 


y is given but regarded en- 
tirely from the point of view of the 
prediction of behavior, the system may 


still have a simple character. The po- 
tential for a given kind of behavior 
is still directly predictable from the 
strength of the drive, instincts, needs, 
or energies postulated. 

There is another form of this model 
in which the various drives, disposi- 
tions, or needs are considered to inter- 
act, For example, the individual may 
be conceived of as being controlled by 
his intellect and his emotions, but his 
behavior must be understood in light of 
the interactions of these two forces with 
a third variable, the will as in the Ror- 
schach Test (1942). Again, this makes 
complex the calculation of strength or 
weakness but does not change the over- 
all method of making predictions. 
Whether dealing with will, intellect and 
emotions or ego, superego and id, the 
effect of interaction is only to increase 
or decrease the tendency of one of the 
needs to function or to strengthen or 
weaken one of them or perhaps to pro- 
duce a fourth or fifth additional need. 
The basic method of prediction stays 
the same although the calculation of 
strength or weakness in such a model 
becomes more difficult. 

The obvious problem, of course, for 


such rudimentary method of prediction 
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3 redict anything at all. Ifa 
is how to ee ates or needs 
system includec mete 
and these are ordered on some r 
system from high to low, does one as- 
sume that the person will always act 
in the fashion to be predicted from his 
strongest or highest need? If an S is 
more oral than anal, does he always act 
in an oral fashion? Actually the most 
logical assumption in regard to any spe- 
cific instance is that he will always act 
the same way. One might presume on 
a statistical basis, as it is sometimes 
done, if the individual js at the 70th 
percentile on Need A and at the 30th 
percentile on Need B, 70% of the 
time he would act in one fashion and 
30% of the time in the other over some 
undefined period of time. However, 
the only sensible statistical or logical 
prediction in any specific instance, if no 
other variables are concerned, is that he 
would act in accordance with the higher 
need. This might still give fairly good 
prediction if only 2 variables are in- 
volved, but if 20 variables are involved 
and many of them are very close in 
value or “strength,” then the amount of 
error begins to increase, In fact, it 
becomes a problem to predict even 
slightly above chance and, indeed, ex- 
cept for some limited and highly con- 
trolled experimental situations, this is 
the problem in psychology now. A 
recent illustration of this failure is re- 
ported in a carefully controlled stud 
by Little and Shneidman (1959) who 
failed to find much relationship be- 
tween interpretations of Psychological 
tests (Rorschach, MAPS, TAT, and 
MMPI) and anamnestic data, Loey- 
inger (1959) summarizing the predic- 
tiveness of individual tests in the recent 
Annual Review of Psychology States, 
“To date the only tests which meet 
standards for individual prediction are 
those of general ability” (p. 305), 
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Previous reviewers have made similar 
statements. . ; 
Another problem which arises in the 
prediction of behavior with this simple 
model is that it soon becomes apparent 
that the strength of a wish, need, or 
drive to achieve some goal such as 


being taken care of, obtaining love, or 


injuring someone is not a good pre- 
dictor of the occurrence of behavior 
directed towards the achieving of that 
goal. To some extent this problem can 
be dealt with by the notion of interac- 
tion of needs, but usually in order to 
account for the discrepancy between 
need or wish and behavior, constructs 
of the same order do not provide suffi- 
cient explanatory basis. It is actually 
necessary to postulate some other kinds 
of internal constructs to account for the 
discrepancy between what might be 
called wish, desire, need, or instinct and 
observable behavior, 

In the measurement of these need 
Strengths all varieties of tests and de- 
vices have been used. To some extent, 
the Personality questionnaire is utilized 
a little more by people adopting such 
a predictive scheme as that described 
above, but also Projective tests, ob- 
servation, interviewing, and many other 
techniques of Personality measurement 
have been used in this fashion. Test 
construction methodologies may cur- 
rently be more sophisticated in that 
they control for social desirability of 
items, Motivation, faking, lying, and 


inability to understand directions. Re- 


cent tests may also rely on purifica- 
tion of factors 


» cross-validation, or item 
analysis, However, with all these 
“modern improvements” in test desig” 
one is still left with a series of figures 
which are of doubtful utility for the 
actual prediction of behavior at a leve 
satisfactory for either practical applic®- 


: 3 t- 
tion or for the clarification of theore 
ical issues, 
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THE ADDITION OF AN EXPECTANCY 
CONSTRUCT IN THE PREDICTION OF 
Goat DIRECTED BEHAVIOR 


eget additional variables 
Reed 2 aie: the relationship of 
mich, d behavior appears to be not so 
Pict a matter of simple theoretical 
any tay as it is merely the absence of 
Bhaic explicit theory about human 
dictiy or. The development of a pre- 
ban model which recognizes the dis- 
and fae. between need and behavior 
Rout. to systematically take it into 
sophisti Tepresents a second level of 
Stication. 
Mt an earlier date perhaps psycho- 
een with this problem most 
aS re SERA in introducing concepts such 
ae os mab suppression, 
count a reaction formation, etc., to ac- 
served iy the discrepancy between ob- 
ernal ape and the presumed in- 
A QTIVE, need, or instinctual urge. 
aia level of theorizing some sys- 
ox etal is added to the internal 
be havi ional state in order to predict 
ing ioe _ Perhaps another way of say- 
Measure is that in addition to some 
Specific of preference or value of a 
Must rt another systematic concept 
Or-miss mirodueed, not only in a hit- 
into id ashion but perhaps directly 
Psychon, i gee procedure. The 
icizeq oo ytic solution has been crit- 
re a many specific concepts 
Crepane oduced to account for a dis- 
Needs y between drives, urges, Or 
these ss observable behavior, but 
able fees are not readily measur- 
- Tn addition, one does not know 
oration” explanation, i.e. reaction 
sda is the explanatory concept 
e re er such as sublimation or sım- 
Pression. 
1954 Social learning theory (Rotter, 
it is presumed that the relation- 
rea eet goal preference (rein- 
ent value) and behavior can he 
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determined only by introducing the con- 
cept of the individual’s expectancy, on 
the basis of past history, that the given 
behavior will actually lead to a satisfy- 
ing outcome rather than to punishment, 
failure, or, more generally, to negative 
reinforcement. Since the early formu- 
lations of Tolman (1932), expectancy 
theories have become more and more 
widely relied upon both in human 
learning and personality theories. It 
is possible to conceptualize more specific 
constructs such as repression and reac- 
tion formation as only special cases of 
an expectancy for severe punishment 
and that a more general relationship 
holds which includes perhaps all of 
these and also expectancies for punish- 
ment or failure of which the individual 
is quite aware. For example, an in- 
dividual may wish very much to be a 
good dancer and to dance with mem- 
bers of the opposite sex. He makes no 
attempts, however, to dance at a party 
or a dance because he can tell you “but 
I look like a fool when I go out on the 
dance floor.” We need, in other words, 
to introduce no specific construct in- 
volving the “unconscious” to explain 
the discrepancy between his wish and 
his behavior. The S may or may not 
be aware of expectancies which influ- 
ence his behavior. Whether or not he 
is aware of them may affect the degree 
to which these expectancies change 
with new experience as well as other 
variables. The degree of awareness 
may be an important additional vari- 
able; however, the level of expectancy . 
itself is the broader variable which 
bears a direct relationship to the po- 
tential occurrence of @ specified be- 
havior. 


The question arises, then, of how one 
takes into account such factors in an 
ituation. It could be 


actual testing S! s 
said that no one 1S really so naive as to 
believe that the strength of an internal 


motivational condition or need is a di- 
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rect predictor of behavior. E 
or other, whether or not the indivi us 
had learned a given behavior o oE 
pected it to work is also an important 
aspect of prediction. However, more 
often than not this aspect of prediction 
has been treated as a source of error, 
something to be eliminated if possible, 
both in testing or in the validation of a 
test instrument. As a matter of fact, 
many currently used instruments at- 
tempting to assess the strength of mo- 
tives, drives, or needs are usually con- 
founded. Although they may 
sophisticated in methodology, 
items or the test variables usua 
in part to what the individual 
overt behavior (“I frequently 
temper”), in part to what he wished 
(“I would like to have more friends”), 
and in part to what he expected to be 
the outcome of his own behavior (“I 
feel that other people do not appreciate 
my good intentions”), To some extent 
these impure items probably add to 
prediction by providing more than one 
kind of referent for behavior, but the 
nonsystematic way in which they are 
used also limits prediction, 
In trying to predict goal 
havior from tests, two pos 
open. One of these is t 
predict behavior from other behavior 
which presumably is functionally or 
predictively related to the test behavior, 
What this involves is analyzing test 
situations as behavioral samples under 
a given set of test conditions, For 
example, to assess dependent behavior 
one could use direct observation tech- 
problem 


be quite 
the test 
lly refer 
did, i.e., 
lose my 


directed þe- 
sibilities are 
© attempt to 


niques, perhaps in 
situations, in which the S is 
for help-seeking behavior (ef. Naylor, 
1955). In questionnaires the items 
should refer to what the S does, not 
to what he expects, wishes, or feels, 
The use of behavior samples for pre- 
dictions or the regarding of all kinds 
of tests including projective tests es- 


solving 
scored 
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sentially as samples of behavior to be 
analyzed in terms of what the S does 
under these conditions has been de- 
scribed elsewhere (Rotter, 1954). Like 
the work sample test in industry it 
undoubtedly provides the best pei 
tion to a limited specific behaviora 
criterion since it requires the fewest 
intermediate constructs and the ewe 
assumptions regarding the action 0 
other variables, 

However, there are many problems, 
both theoretical and clinical, when it is 
important to break down this behav- 
ioral measure into its major deter- 
minants of reinforcement value and 
expectancy for the occurrence of the 


reinforcement, For example, in psy- 

H v 
chotherapy an understanding of hov 
some 


behavior or group of behaviors 
may be most readily changed requires 
analysis into at least these two com- 
ponents. Even when strictly concerned 
with predictions of behavior in a broad 
band of life Situations, rather than 
change, analysis into separate deter- 
Minants may provide greater prediction 
than a work sample or behavioral tech- 
nique. In this second alternative it 
is important either to control or Sy 
tematically Vary the other variable o! 
measure both, ` For example, Liverant 
(1958) has Measured some needs oY 
Presenting pairs of items involving gon 
preference matched for social desirabil- 
ity, and Jessor and Mandell (Mandell. 
1959) are developing a similar test = 
measure expectancy for success in satis- 
fying the same needs, as 

In using projective material such a5 
the TAT, Crandall (1981) has demon- 
strated that expectancy for need sat!s 
faction, for which the term freedom ° 
Movement is used in social learning 
theory, can be reliably measured py 
selecting particular kinds of referent? 
The work of Mussen and Naylor 
(1954), Kagan (1956), and Le 
(1957) gives strong evidence that t™ 


Pe 
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relationship between theme counts of 
aggression on the TAT and overt ag- 
gressive behavior depends to a large 
extent on whether or not that overt 
ehavior is socially acceptable in the Ss’ 
seat ae or social climates. The 
lationship between theme count and 
ae aggressive behavior appears to 
pe only when the Ss do not have a 
meh expectancy that aggressive behav- 
a will be punished. Atkinson and 
a SERE (1956) report that in a num- 
A studies of need achievement, it 
S become clear that prediction of be- 
tikinn is enhanced if, in addition to 
R a measure of need achievement 
ia TAR achievement theme counts 
moe -like material, an additional 
Fe an, of expectancy for success 1s 
aken into account. 
bee with this type of testing 
Pitz tee ae retenty published study of 
Sites d (1958) provides a more sys- 
lable analysis. Using a highly re- 
inatio sociometric technique of nom- 
T z of fraternity brothers as his 
= Ta criteria and dealing with the 
relat; ependency, Fitzgerald found no 
es between theme counts and 
jeka behavior. Presumably, dependent 
S is not socially acceptable 
ives male college students. He had, 
ever, independent interview ratings 
es value, that is preference or 
of e for dependency satisfactions and 
Teedom of movement, Or expectancy 
Nat behavior directed toward achieving 
€pendency would be satisfied. 
die found that by using, these meas- 
atio he did obtain a significant corre- 
Birr between theme counts and the 
ot ta between need value and 
icali om of movement. More specif- 
Y, what he called a conflict score 
T Score indicating the degree to which 
esi vidual preferred dependency at 
a dependency satisfactions but ex- 
ie d that he could not achieve them 
related with theme counts for de- 
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pendency. On the other hand, an In- 
complete Sentences Blank measure of 
dependency which utilized behavioral 
referents as well as reinforcement value 
and expectancy referents did show a 
low but significant relationship of the 
number of completions dealing with de- 
pendency with both the sociometric and 
interview measures of need potential or 
actual dependent behavior. Although 
an actual analysis was not made, it 
seems very likely that the reason for 
the correlation in the case of the ISB 
and not the TAT is that at least some 
of the ISB completions were descrip- 
tions of actual behavior. Possibly a 
purer measure of behavior would have 
shown a greater relationship to the 
sociometric and interview rating assess- 
ment of actual dependent behavior in 
life situations. 

Should we build two instruments or 
at least two sets of testing operations to 
separately assess need value and free- 
dom of movement, or should we at- 
tempt to use behavioral measures in 
order to make our predictions about 
behavior, we would still be faced with 
the problem of predicting in a specific 
situation. Given measures of six be- 
havior potentials, however arrived at, 
the problem remains that of knowing 
which of these is likely to be the be- 
havior preferred in some specific situa- 
tion. One is again forced to predict 
that the behavior with the highest po- 
tential always occurs and one is limited 
again in prediction to a very low level 
of accuracy. In the laboratory situation 
where we can reduce the possible alter- 
natives to two, significant, although not 


predictive results, are possible. In the 


2 Whether or not the relationship between 
high reinforcement value 
and low expectancy is general is not yet 
known. It appears at this time to possibly 
depend on whether or not the test material 
and testing situation ts conducive to the free 
i f fantasy. 


theme count and 


expression o! 
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life situation where the ges a 
very frequently of a large or 5 
question arises of whether or ee, 
useful prediction is possible. T us leads 
us to a third level of sophistication, one 
in which the psychological situation is 
one of the variables on which prediction 
is based. 


THE PSYCHOLOGICAL Situation as A 
THIRD DETERMINANT oF GoaL 
DIRECTED BEHAVIOR 


Few would deny that the psycho- 
logical situation will affect the potential 
of occurrence of any behavior or class 
of behaviors. However, the fact that 
behavior will vary from situation to sit- 
uation is most often treated as a source 
of error, something to be avoided, If 
possible, one should construct tests or 
find personality variables which rise 
somehow above the situation. It is 
probably no exaggeration to say that 
thousands of hours of wasted work 
have been done by psychologists in the 
vain goal of finding either tests or var- 
iables which would, somehow or other, 
predict regardless of the situation in 
which the test is given or regardless of 
the situation in which the predicted be- 
havior is expected to occur, 

There are three separ. 
here which will be discussed as one 
basic problem. The first Problem is to 
understand the effect of the testing sit- 
uation on test results, For example, 
Sarason (1950) has Provided an excel- 
lent discussion of some of the influential 
situational variables in intelligence test- 
ing. The second probl 


em is to un- 
derstand the nature of the criterion 
situation 


which affects the criterion 
measures, The third and ultimately 
most important problem is to devise 
our tests with full consideration of the 
nature of the test situation in order to 
predict behavior in other situations for 
which the test was constructed. In 
other words, we need to devise tests not 


ate problems 
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to predict personality or needs or ya 
havior in the abstract but in specified 
situations or classes of situations if we 
want high prediction. We need to 
know and take into account the dimen- 
sions of situation similarity in devising 
test procedures. TF l 
Cronbach (1956) has criticized the 
failure to regard differentially the crite- 
rion situations in which tests are ap- 
plied. In regard to the test situation 
we have only attempted to standardize 
the test procedure but usually we 
ignored the importance of the socia 
context in which the test is given. Per- 
haps the most important thesis of this 
paper is that the psychological situation 
needs to be understood and systemat- 
ically considered in our predictive for- 
mula, not treated as a source of error 
or something that can be ignored be- 
cause part of the total situation is 
standardized, 
Recently there have been a number 
of studies Which demonstrate that al- 
most all tests are subject to faking, tO 
instructional variation, to examiner in- 
fluences, to testing conditions, etc., Te 
gardless of the type of test (Borislow, 
958; Green, 1951; Gross, 1959; Mus- 
sen & Scodel, 1955; Rotter, 1955): 
The general inference drawn from these 
Studies is that the tests are poor. Ac 
tually the implication of such findings 
is that we are making inefficient use 0 
our tests. If the test situation for me 
personality tests is one in which ge K 
conformity or acceptability is easily 
achieved and no other satisfactions a 
given up in achieving acceptability, 
then for Some purposes this motiv” 
should þe controlled. However, the 
test situation can also be utilized ud 
Measure the importance of social on 
ormity for the individual, What A K 
call faking is only our recognition of t ‘d 
fact that the S is taking the test aul 
different Purpose or goal than the 0) 
the examiney wants him to have. FO 
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some purposes it might be important to 
understand what kind of goals he ex- 
hibits in this kind of situation. More 
often than not we simply try to control 
what we should be studying. For ex- 
ae in giving intelligence tests it 
he ae better to study systematically 
ent ec on performance of encourage- 
4 and discouragement rather than 
ie attempt some mythical neutral atti- 
p which is presumably the same for 
teed examiner. Knowledge of the ef- 
of S of situational variations would be 
E particular value in understanding the 
meaty diverse and contradictory re- 
y = of apparently similar research in- 
p tigations. For example, Henry and 
AS (1956) found that large, pre- 
aeea differences were obtained be- 
Ween two comparable groups on the 
A onach test if one group was re- 
that ed before the regular instructions 
to st the test had been used frequently 
im Fuad psychopathology. An obvious 
oun of this study is that in- 
coll igators using this same test 1n the 
ie laboratory and in the clinical 
Stilts, may well produce diverse Te 
tion nother example of how the situa- 
by a be used in testing 18 provided 
Poss patient who is being assessed for 
he ible benefit from psychotherapy. If 
al Clinic or hospital can provide both 
ther, and female therapists and also 
ae pists who rely on support and di- 
ho as well as therapists who remain 
Dine and passive, then the testing 
Shere can be varied so that those 
aol influences are present. The 
indice could provide information to 
dicate what kind of therapy and what 
ies of therapist is likely to provide 
ion et with the most efficient con- 
entio, for relearning. For more com 
is Rah purposes of clinical testing, it 
What enone important to know under 
Dar Conditions the patient behaves in 
anoid fashion and under what con- 
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ditions he does not, than it is to know 
how many percentiles of paranoia he 
has. 

Not only can the test situation itself 
be analyzed as a behavioral sample but 
situational referents can be incorporated 
into the content of items by systematic 
sampling. For example, questionnaire 
items can deal with the conditions 
under which the S feels tense, nervous, 
happy, has headaches, etc. as Mandler 
and Sarason (1952) have done for 
some intellectual test taking situations. 
Similarly, projective methods, partic- 
ularly TAT-type tests, can systemat- 
ically vary the situation through the se- 
lection of test stimuli as has been done 
by Crandall (1951) and McClelland, 
Atkinson, Clark, and Lowell (1953). 
More recently Murstein (1959) has 
suggested a conceptual model for stim- 
ulus variation with thematic techniques. 

The many studies indicating marked 
effects of testing conditions suggest that 
it is of great importance in the publica-, 
tion of any test that descriptions of the 
differences in test results that are likely 
to be associated with different kinds of 
testing situations be provided. No test 
can be adequately understood unless the 
data regarding its standardization or 
use includes systematic descriptions of 
the differences in test results which are 
a function of different kinds of testing 
conditions and different kinds of pur- 
poses in taking the test for similar 
samples of Ss. Only when we know 
whether an S is likely to produce dif- 
ferent test results when he is taking the 
test to demonstrate how imaginative he 
is as compared to taking it to prove that 


he needs help will we be adequately able 


to understand the meaning of test re- 


sults and to predict future behavior 


from them. i i 
There have been personality theorists 


who have made much of the importance 
Kantor 


of the individual’s life space. 
(1924) was one of the first to empha- 
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size that the basic datum of psychology 
is the interaction of an individual and 
his meaningful environment. For Kan- 
tor, people do not have internal char- 
acteristics in the same sense as for other 
theorists; rather they have a reactional 
biography of interactions with the envi- 
ronment. Lewin (1951) has also em- 
phasized the importance of the life 
space or psychological situation in 
the determination of human behavior. 
Brunswik (1947) has repeatedly called 

for analyses of and sampling of psycho- 

logical situations for predictive pur- 

poses. Helson (1948) has applied his 

theory of Adaptation Level to social 

psychology stating that the effect of the 

total field can be quantified by careful 

ordering of the field of exposed stimuli, 

€cent concern with the importance and 


matic study of situation 
Variation has } 


cen expressed by Allport 
(1958) and Cronbach (1957), 


Ina more limited a 
way, Psychoanalysis h 
few areas, that certai 
rected behavior depe: 
chological situation, 
making distinctions p 


vidual’s potential resp 
figures vs, 


nd less systematic 
as suggested, in a 
n kinds of goal di- 
nded upon the psy- 

This is done in 
etween the indi- 


Beyond this, little 


systematic analysis of differences in life 


situations has heen made by the tradi- 
tional analyst. Murray's (1952) for- 
mulation of the nature of Personality 
stresses that behavior is ą function of 
the interaction of an individual with a 
Psychological situation which he felt 
could. be categorized as “press,” 


At 
a more specific level Atkinson and 
Raphelson ( 1956) have shown the 


value of including situational variables 
in studying achievement behavior. This 
general point of view has also been 
represented in sociology by Thomas 
(1951) and Coutu (1949). 

In social learning theory, it has been 
hypothesized that the situation operates 
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primarily by providing cues for BE 
S which are related to the magni 
tude of his expectancies for reinforce- 
ment for different behaviors. The ef- 
fect on the value of the reinforcement 
itself operates through expectancies for 
associated or subsequent rein forcements 
which may differ from situation to sit- 
uation. It has also been hypothesized 
that situations may be usefully catego- 
rized in terms of the predominant rein- 
forcements as culturally determined for 
any large or small culture group. There 
are many other possible ways of et 
egorizing situations depending upon pm 
predictive Purposes involved.” Met i 
ods of determining generality or pe 
mining the dimensions of amin 
among situations have been describe 
in an earlier paper (Rotter, 1955). 


, e dif- 
* Several writers have pointed out bere 
ficulty of identifying situations independe 


t i £ ascribe & 
of behavior, That is, how can one describ 
Situation as one 


independently of 
Problem is not diff 
ing stimuli 
though perh: 
or other cor 
color stimul 
sponse of thi 


would a physical wes 
the S's response? ib- 
erent from that of p $ 
along dimensions of color cial 
aps vastly complicated in ae 
mplex situations, In the aoe 
i ultimately the criteria is a nes 
€ scientist or observer, someti! nc 
a response to an intermediate instrument, a 
One that is at the level of sensory disorini i, 
tion and so leads to high observer oar: 
€ of the social situation, the Hi on 
ination is common sense base han 
nding of the culture rather rail 
the reading of an instrument. As such, 5 tly 
ability may be limited but still be sufficien In 
high to considerably increase prediction. za 
this way specific situations could be idee 
as school situations, employment sae 
girl friend situations, ete. For the DE a 
of generality various kinds of psycholog a 
constructs could be devised to arty’, a 
classes of situations which have S aes 
meaning to the S. The utility of such mee 
would have to be empirically determine: tive 
ing on the S’s response. The obj eade 
referents for these situations, which pro in- 
the basis for Prediction, however, can Bere 
dependent of the specific S. That is, 


s 1, cour 
can be reliably identified by cultural, 
mon sense terms, 


an understa 
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_ Two illustrative studies of an increas- 
a number of experimental analyses of 
rchavior which vary both internal char- 
acteristics and the psychological situa- 
tion systematically in the same study 
are described below. These studies 
~~ it basic’ paradigm that the pre- 
fly ey oe individual (personal- 
Phat situational (experimental ma- 
vicina? variables can be observed 

aneously and their interaction 
studied, g 


Intusp " 
LUSTRATIVE StupiES VARYING BOTH 
THE SITUATION AND INTERNAL 
CHARACTERISTICS 


res DA doctoral dissertation Dy 
he m ied illuminates very clearly 
boi : ential of greater prediction when 
acterists situation and the internal char- 
The b i are varied in the same study. 
Aa EPE TAA being studied by James 
ineludin a variety of learning variables, 
Pinca. acquisition, changes or shifts, 
0 ai generalization, and recovery 
ion T ized expectancies for gratifica- 
Volved Wo general hypotheses were in- 
Previoy “4 this study growing out at 
Phare us work by Lasko _ (1952). 
dme (1957), Neff (1956), and 
Schro, — Rotter (1958). Hunt and 
vhat “er (1958) have also dealt with 
The p Pears to be a related variable. 
Stateq rst of these hypotheses might be 
hat the a situational one. That is, 
differs € nature of a learning process 
th até depending upon whether or not 
at is one in which the rein- 
Come ents that occur are a direct out- 
the i. ‘Some internal characteristic of 
Charac dual such as skill, a physical 
Sity teristic, or whatever, versus @ 
Sssentian where the reinforcements are 
oF by hi: ly controlled by someone else 
be yon, Pag or by conditions or powers 
Xan d the S’s control. Perhaps a good 
Ramee of the latter would be a dice 
Wing) the winning of a door prize oF 
Soup spilled on one because a 
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waiter tripped, ete. James utilized line 
and angle matching tasks reinforcing 
each S positively on his guesses in six 
of the eight training trials. He. spe- 
cifically hypothesized when the sit- 
uation is structured in such a way that 
the S expects the occurrence of rein- 
forcements to be beyond his control 
or partly beyond his control, incre- 
ments and decrements in expectancy 
for gratification as a result of ex- 
perience are smaller, the number of 
unusual shifts, that is, shifts up after 
failure or down after success, are 
greater, extinction is faster, and there 
is less generalization from one task to 
another and greater recovery following 
extinction. 

The measurement of individual dif- 
ferences in this study followed from the 
previous work of Phares which sug- 
gested that-individuals can be differen- 
tiated in the degree to which they see 
the world and the things that happen 
to them as controlled by others or as 
determined by chance or unpredictable 
forces. The second hypothesis, then, 
was that all the differences which would 
occur as a result of the situational con- 
ditions would also be true of individ- 
uals within all groups as a function of 
their general attitude towards “control 
of reinforcement.” 

In order to predict the individual dif- 
ferences in attitude, James enlarged and 
revised the questionnaire first devised 
by Phares. This was given to all Ss 
at the end of each experiment. , The 
results are most striking. All of the 
predicted outcomes hypothesized above 
resulting from the differences im in- 
structions or situations were obtained 
were statistically significant 
ption of recovery follow- 
ing extinction, which showed a strong 
trend in the predicted direction. Sim- 
larly, within each group the individuals 
high as compared to low on the ques- 
tionnaire differed significantly in ex- 


and all 
with the exce 
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actly the same way as did the poiu 
themselves as a result of the differen 
instructions or situations presented. 
Although individual prediction was not 
the concern of this investigation, it is 
quite clear that a simple formula could 
be devised which could predict all of 
the learning variables involved in this 
study with a fair degree of accuracy, 
Certainly it is clear that a greater de- 
gree of accuracy is possible when both 
the situational and individual variables 
are taken into account. Perhaps far 
more important, this study indicates 
that various experimental paradigms in 
studying human learning are likely to 
produce different kinds of results, 
Whether or not a 


given learning 
task is one in which the S feels that 
success is dependent upon the experi- 
menter’s manipulation (for example, 
when he is expecting to predict a ran- 
dom sequence of red or green lights) 
or is the result of his own skill provides 
a crucial difference in the nature of the 
learning process itself, 


The study of James, however, does 
not provide a satisfied feeling that it 
illustrates all of the problems of predic- 
tion involving both the individual’s 
characteristics and the situation from 
which the prediction is made. 
an almost too simple picture 
interaction of these two variables, 
other recent dissertation 
(1958) suggests that this r 
can be more complex, and 
more clearly the effect of t 
situation on more common] 
of assessment procedures, 

Moss studied a general behavioral 
characteristic which he called cautious- 
ness. Essentially this was defined as 
the avoidance of risk, the selection of 
the safest alternative in a situation 
where failure or negative reinforcement 
was possible. He varied the situation 
by reacting differently to three groups 
following the administration of a ques- 


. An- 
by Moss 
elationship 
illustrates 
he testing 
y used types 
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tionnaire which he described as a meas- 
ure of social acceptability. One group 
was shown false norms at the conem 
sion of the questionnaire that indicated 
that they were in the ninetieth percen- 
tile of social acceptability for a college 
group. Another group of Ss wae 
shown that they were at the tenth per 
centile, and a third group was given no 
information about the results of ov 
Supposed test of social acceptability. I z 
hypothesized that cautious behavio 
would increase with negative reinforce- 
ment. Immediately following this prea 
cedure, the Ss were given two pe 
tive type tests and behavior on these 
tests was analyzed as to degree of catt 
tiousness. ‘al 
Prior to the giving of the ec 
acceptability” questionnaire the Ss had 
been tested on the level of aspiration 
board. Behavior on the level of asp ed 
tion board (Rotter, 1942) was catego 
tized into cautious or noncautious a 
terns.* The general tendency to see™ 
safe alternatives in the obtaining ° 
satisfactions then was measured in @ 
situation in which the § himself has 
Some control over failure or success. 
One kind of beh: 
that of sorti 
MAPS test. 
figures and as 
piles any w 
dure was re 


i : yas 
avior studied W3 


ng figures taken from 
The S was presented A 
ked to sort them into ie 
ay he wished. The pane 
peated a second time askit d 
fora different kind of sort, and a THIRA 
time. The sorts themselves were cha! 


‘ n 
acterized as being safe or cautious } 
f as 
* Cautious and Noncautious behavior pr 
characterized according to the patterns eee 
Scribed by Rotter (1954, pp. 318-324). tious 
terns 1 and 3 were considered as noncau The 
and 2, 4, 7, and 8 as cautious patterns. riety 
latter group are characterized by a vaging 
of techniques presumably aimed at nic 1 
failure to reach explicit goals. Patte! ect- 
and 3 are characterized by higher, ep or" 
ancies than performance but within | num- 
mal” bounds and consequently a higher 


ber of failures to reach one’s estimates. 


| 


 _ —— SS 
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that they dealt with highly objective 
characteristics of these figures, or less 
tt they dealt with character- 
to | which were more abstract or had 
>e read into many of the figures. 
nate sorts based on personality 
eis eristics were considered as non- 
oe as opposed to safer or more 
Prine us sorts such as those into groups 
Ne nand women, children and adults, 
Sroes and whites, etc. 
Was oe kind of behavior studied 
ne S’s response to a series of four 
T Pictures. In this case the S’s sto- 
re aes treated as Weisskopf (1950) 
tan reated them with her measure of 
ie Scendence. A cautious or safe in- 
an was one sticking close to 
Che characteristics of the picture and 
comme which the theme itself was a 
on one. 

Moss found some differences among 
9 onps in the direction he had hy- 
grou peed, that is, that the threatened 
Was P, the group that was told that it 
Brot. the tenth percentile, showed 
Wo r cautiousness than the other 
gro 8Toups. The differences between 
Ops, however, although consistently 
te direction he predicted, only ap- 
Poet significance and were not 
hiss However, when Moss, dence 
on F within groups into cautious and 
Cautious on the basis of their level 
hight tation patterns, he found some 
tions” oniieant differences. The ae 
ference showed no significant dit- 
SS among the three eap 
Signife the noncautious Ss So 
i ions ai differences between con 
0 - That is, noncautious Ss in 
fton, CNdition responded differentially 
ion POncautious Ss in another condi- 
These differences were primarily 
th tis Sreater noncautious behavior in 
in 4 information group. Differences 
None behavior between cautious and 
tant tous Ss were also highly signifi- 
On both tests within the no-infor- 


hi 
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mation but not in the other 


groups. 

In spite of the complexity of this 
study, a few findings seem relatively 
clear from an analysis of group means 
as well as significance of differences. 
Ss who were cautious on the level of 
aspiration test, which is a somewhat 
free situation, were also cautious in the 
other test conditions. Of course, this 
does not mean that they were cautious 
in situations which were not perceived 
by them as evaluation situations. On 
the other hand, Ss who were noncau- 
tious on the level of aspiration test 
appeared to maintain this greater risk 
taking behavior under test situations 
where no information about results was 
given. However, when they were neg- 
atively reinforced, they became more 
cautious and they also did not appear 
to be different from cautious Ss under 
conditions where they were quite suc- 
cessful. Perhaps this is related to the 
presumed conservatism which follows 
from success. There was no consistent 
prediction from the level of aspiration 
situation to the two “projective tests” 
which could be made without consider- 
ing the situation. In at least two of the 
situations the cautious Ss were not sig- 
nificantly different from the noncau- 
tious Ss. On the other hand, the 
psychological situation or the three dif- 
ferent situations seemed to have no 
effect on the cautious Ss. Only in the 
interaction of the noncautious Ss with 
the situational variables was prediction 
possible from the level of aspiration test. 

A similar type of result to that .of 
Moss was recently reported by Lesser 
(1959). Lesser found that intercorre- 
lations among various measures of ag- 
gression were significantly higher under 
experimental conditions of low anxiety 
than under conditions of high anxiety 
about aggression. 

James’ results suggest a rather sim- 
ple relationship between dispositional 


group 
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and situational variables, but it is Sra 
from the study of Moss and other 
studies that a simple additive or mala 
plicative relationship will not always 
describe the nature of the interaction. 
An important implication of this priti- 
ciple is that there is a general lack of 
efficiency in research studies in which 
only one set of variables, that is only 
dispositional or situational, are syste- 
matically varied, since the conclusions 
of the two sets of studies cannot be put 
together in a simple fashion. Unless 

both kinds of variables are systemat- 

ically varied within the same inve. 

tion, both an underst: 

determinants of behav 

diction of it may suffer 

A striking 


stiga- 
anding of the 
ior and the pre- 
example of the importance 
of studying the effects of dispositional 
and situational influences simultaneously 
is-provided by Helson, Blake, Mouton, 
and Olmstead (1956). In applying 
Adaptation Level theory to a study of 
attitude change they found important 
interactional effects when situations 
were varied in external influence pres- 
sure and individuals were distributed 
on a measure of ascendancy-submis- 
siveness, 

It is true that many of the above 
propositions are obvious. Most psy- 
chologists recognize that there is a dif- 
ference between overt behavior directed 
toward certain goals and the desires 
that individuals have to obtain these 
goals. Similarly, most Psychologists 
know that the psychological situation is 
a determinant of the occurrence of a 
given behavior. The thesis here, how- 
ever, is not merely that this is the case 
but that all of these variables must be 
ordered and studied systematically, in 
order to make predictions. 


SUMMARY 


The major contention of this paper 
has been that the prediction of goal di- 
rected behavior of human subjects from 
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test procedures has been and ee) 
tinue to be at an extremely low hit-or 
miss level because of inadequate con- 
ceptualization of the problem. Findings 
are frequently not replicatable becaus 
of the failure to systematically dinane 
tiate behavior, reinforcement value, and 
expectancy as internal variables and 7 
recognize that these variables are a 
fected by the psychological situation. 
The psychological situation of the pa- 
tient in the clinic is so different from 
that of the elementary psychology stu- 
dent taking a test as part of an ere 
ment that it is possible that the kinds 
of predictions which can be made in one 
situation would hardly hold in the omer 
The evidence that faking is possible anc 
that different norms obtain when sub- 
jects are job applicants, employees, O! 
volunteers does not necessarily meni 
that a test is no good. Nor is predic- 
tion essentially hopeless because it can 
be demonstrated that two experiment- 
ers, whether the same sex or opposite, 
or slight changes in the wording of ay 
structions, will differentially affect tee 
Or experimental results, All of ie 
things indicate only that the peycholog 
ical situation, Perhaps acting pumas y 
through the expectancies they arouse PY 
the cues present, considerably affect as 
havior. Tt is necessary that we do oe 
consider such influence as error to we 
ignored, as difficulty to be avoided or i 
the problem of some other profession re 
investigate. Rather it is necesary i 
Study these influences and consider 
them regularly and systematically j 
Predictive schema, That is, for son z 
purposes, factors such as social desi 
ability of items, examiner’s behavior 
and the subject’s goals in the test pee 
tion should be controlled, and in ot Y 
Cases they should be allowed to vaty: 


be 

n all cases, however, they must 

Systematically considered, ve the 
Implicit in this entire paper is 


: oa 
belief that a satisfactory theory of $' 
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directed behavior is a primary prere- 
quisite for developing adequate tests. 
a edge of statistics and test con- 
But ‘en procedures can be valuable 
kor y cannot supplant an adequate 
the y of behavior which is applied to 
test taking behavior itself. 
ia oe at a fully systematic model 
Eich ating these general or high order 
tarn ew and to coordinate them in 
AA o lower level sets of content vari- 
wit devised for different purposes, 
3 We a long and arduous but reward- 
ng task. 
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LEARNING THEORY AND THE ACQUISITION OF VALUES’ 


WINFRED F. HILL 


Northwestern University 


Bane hae by which a child ac- 
his y the values of his culture and 
ae arious overlapping subcultures is, 
awe (Duke review of the subject points 
hi ukes, 1955), still rather obscure. 
ig Seta is certainly not due to 
Bee interest in the topic. Psychol- 
a Psychiatrists, anthropologists, 
tors oo pediatricians, and educa- 
Pretic all given attention to the 
the ion of how children come to share 
ar pre races, ideals, and ethical stand- 
this eo those around them. Nor 1s 
are nterest undeserved, for few topics 
of greater practical importance. 

maps this very convergence of 
respo St from many directions 1s partly 
in eee for the difficulties involved 
tenes ns the topic. This area of 
ace has become a battleground for 
ne terminologies, with one 
Pipa. having a multiplicity of 
aDea istinct meanings, and with what 
bearin to be the same meaning often 
term ng different labels. Although many 
PE contribute to this confusion, 
Sp ore of particular interest here: 
na fication, introjection, and inter- 
ae All involve some relation 
Sign cen an individual, hereinafter de- 
eure the subject (S), and another 
Or a or personalized entity, the model 
Way > Such that S’s behavior is in some 
thes patterned after M’s. However, 
€ terms may refer either to @ state 


for this article was sup- 


he 
research 
Carnegie Cor- 


1 
Dor, 
Dora! by a grant from the 
ang 1" to the Department of Psychology 
eee School of Education at North- 
Xprect University. The author wishes to 
Robe SS his appreciation to Donald Campbell, 
Semina Winch, and the members of their 
Contry im social psychology for their many 
ibutions to his thinking on this topic. 
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of affairs or to the process which 
brought it about (Lazowick, 1955) ; 
the M may be a person, a group, or an 
idea (Glaser, 1958) ; and the relation 
may involve specific responses, broad 
meanings, or emotional reactions. 
Some of the confusion as to the 
meaning of the term identification may 
be seen in the following uses. Lazowick 
(1955) distinguishes three main uses 
of the term identification in the litera- 
ture: pseudoidentity, imitation, and 
personality change. He suggests that 
the term should be used only with 
regard to broad meanings, with imita- 
tion being the corresponding term for 
specific acts. Freud (1950) contrasts 
a boy’s identification with his father, 
which forms the basis of his ego ideal, 
and his identification with his mother 
as an abandoned object cathexis. Davis 
and Havighurst (1947) maintain that 
a child will identify with his parents 
only if he loves them, but Anna Freud 
(1946) emphasizes identification with 
the aggressor. Lynn (1959) contrasts 
sex-role identification, which is “re- 
served to refer to the actual incorpora- 
tion of the role of a given sex, and to 
the unconscious reactions characteristic 


of that role” (p. 127), with sex-role 
He 


preference and sex-role adoption. 
regards figure drawing as a measure of 
identification and choice of dolls for 


play as a measure of preference. Sears, 
Maccoby, and Levin (1957), however, 
use children’s choices of dolls as a 
measure of identification. Sanford 
(1955), discouraged by such confusions 
iders the possibility of 


of meaning, cons! the possibi 
abandoning the term identification al- 
gether, but decides to retain it to de- 


tog 
scribe a defense mechanism involving 
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č doption of M's behavior by S, 
E which is not important 
in normal personality development. 
Finally, this collection of ee 
diverse as it is, still omits those cases 
where identification is used as a syn- 
onym for loyalty or for empathy. 

There is similar confusion concern- 
ing the meaning of introjection and 
internalization. Both carry the implica- 
tion of values being incorporated into 
the personality. Hence, particularly 
with introjection, there is the sugges- 
tion of some relation with orality 

(Freud, 1950), However, Freud in 

the same discussion also uses intro- 

jection synonymously with identifica- 
tion. Parsons (1955), on the other 
hand, treats identification and inter- 
nalization as synonyms, 

The many discussions of these three 
terms in the literature seem to indicate 
that there are several processes in- 
volved but no generally accepted con- 
ventions for labeling them, A number 
of writers, including several already 
cited, h discouragement 

but the usual 


have 


é Present writer 
believes that clarity would be served by 


abolishing not only identification but 
also introjection and internalization 
from the technical vocabulary of per- 
sonality development. 

What, then, should be substituted ? 
The topics to which the above terms 
have been applied certainly deserve 
discussion, and if the redefinition of 
the old terms is unsatisfactory, the 
introduction of new terms would he 
even worse, An answer may be found, 
however, in a sort of reductionism. 
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Since the processes involved are learn- 
ing processes, the existing tocaban 
of learning is the obvious candidate 
for the job of describing them. Tt 1s 
quite possible, of course, that the Bee 
ing vocabulary of learning theory a 
be inadequate for the complexities g 
value acquisition. However, if its Be 
is carried as far as possible, the ae 
cesses of this application should clarify 
our thinking about personality qever 
ment, while any gaps which | sed 
should point to possible extensions 0 
learning theory, s A 
This approach involves treating hii 
man learning in a sociocultural e 
ronment in the same terms, at at 
for a first approximation, as amne 
learning in the environment of labora- 
tory apparatus. For this purpose, pa 
social rewards and punishments appii 
to humans may be treated as equivalen 
to the food pellets and electric shocks 
used with rats, Similarly, social ne 
are the equivalents of mazes whic 
must be learned in order to obtain the 
rewards and avoid the punishments. 
Human beings of course constitute 
far more variable environment pee 
laboratory hardware, and one on gine 
can exercise greater influence. How- 
ever, since most of the theory in van 
area is concerned with the adaptation 
of S to a relatively constant jan 
environment (whether it be called ae 
ture, social System, or the person 
ities of the Parents), this should pa 
Prove a serious stumbling block. pu 
is ample Precedent for such an E 
proach in the writings of Dollard ait 
Miller (1950), Mowrer (1950), Wh 
ing and Child (1953), and others. ats 
n addition, this approach me 
values as nothing more than inner 
from overt behavior. In principle oH 
assumption should cause no diffictt rd 
Few behavioral scientists would repe 
values (in the empirical, not the ae 
Cendental sense) as fundamentally 4 
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ys Tolne, intis (1943 ) habit strength 
iA ction (1949) equivalence beliefs. 
might t: ke i ae some theorists 
least he e issue with this view on at 

st two bases. 
ate one thing, the measurement of 
ethical (including attitudes, ideals, and 
verbal standards) is commonly by 
al methods (see the review by 
ee This leads to the sus- 
lat any measurement of values 
el means must be inade- 
get at iat only verbal measures can 
individ he significance of an act for an 
are ual. However, verbal responses 
os. of the total behavior of the 
ike at organism and may be studied 
uncon ler responses. The processes of 
Yen motivation, semiconscious 
all bition: and deliberate concealment 
Teat icate that it would be unwise to 
Values erbal and nonverbal measures of 
Cither as equivalent. Rather than treat 
he t verbal or nonverbal behavior as 
ti ot indicator of values and the 
Useful as a side issue, it seems more 
emp} _to study both and to ascertain 
a tically to what extent they lead to 
giy same generalizations about a 
en S, 
tha pamer possible objection involves 
nd e suon between | specific em 
(osso meanings, as in Lazowick’s 
tio ) contrasting definitions of imita- 
ie and identification, noted above. 
and | distinction between specific ae 
to th Droad meanings may refer ei i 
See © + ergs of mediating ine 
ralit sgood, 1953) or to the gen- 
invol-, of the stimuli and response 
low, ved, _ Neither of these ae 
lag ever, is dichotomous. Hull (1952) 
ti na ated how a mediator, the frac- 
moal antedating goal response, may 
and gon in animal behavior, and pul 
edi torms (1955) have demonstrate 
th Onal processes 1n rote learning. 
S the mediation mechanism 15 by 
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no means restricted to the “higher 
mental processes” of humans. As for 
the generality of the behavior and of 
the stimuli which guide it, this presum- 
ably represents a continuum from the 
most specific to the most inclusive 
categories. If, for example, washing 
the hands before meals is an example of 
imitation and cleanliness an example 
of identification, where would wearing 
clean clothes be classified? So, although 
the distinction between specific acts and 
broad meanings is a legitimate one, 
there are no sharp breaks on the con- 
tinuum and there is no reason to as- 
sume that basically different laws are 
involved. 

In view of the above considerations, 
an attempt to study the acquisition 
of values as a branch of learnirtg theory 
appears justified. 


Kinps oF REINFORCEMENT 


The concept of reinforcement is basic 
to learning theory. While theorists 
are by no means unanimously agreed 
on the value of reinforcement terminol- 
ogy, there is little question that an 
empirical law of effect holds, that the 
consequences of an act influence its 
subsequent occurrence. A classifica- 
tion of kinds of reinforcers will be used 
here as the basis for analyzing the 
learning of values. 


Primary Reinforcement 


For the present purpose, three kinds 
of reinforcement may be distinguished : 
“primary,” secondary, and vicarious. 
Placing “primary” in quotes indicates 
that it refers to the effects not only of 
innate physiological reinforcers but also 
of those social reinforcers which play a 
primary role in human motivation. 
Presumably the positively reinforcing 
effects of attention and praise and the 
negatively reinforcing effects of criti- 
cism, ridicule, and rejection are at least 
partly learned, but the nature of the 
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learning process is obscure, and at ne 
present level of analysis it seems pref- 
erable to treat praise for a human 
as comparable to food fora rat. The 
distinction between primary” and 
secondary reinforcement is thus one 
of convenience between that which we 
take as given and that for which we 
can find a specific learned basis, Though 
arbitrary, this distinction js perhaps 
no more so than the decision as to 
whether food in the mouth should be 
considered a primary or a secondary 
reinforcer. 
One particular kind of le 
“primary” reinforcement is t 


tion of a generalized tendenc 
others. 


arning by 
he acquisi- 
y to imitate 
Miller and Dollard (1941) 
have indicated how a generalized tend- 
ency to'imitate the behavior of others 
may be learned in the same way as 
any other class of responses. Although 
their demonstrations of imitation in- 
volved S’s patterning his behavior after 
a leader who was present, Church 
(1957) has shown that rats can also 
learn to Tespond 
Same cues to whi 


at humans 
n their behavior after 
that of other people, not only when the 
M is present, but also in M’s absence 
by utilizing the appropriate environ- 
mental cues. As the child is repeatedly 
rewarded for imitative behavior in a 
variety of otherwise different situa- 
tions, and as his Capacity for abstrac- 
tion increases, it seems plausible that a 
generalized imitative tendency would 
develop. It would be desirable, how- 
ever, if the widespread anecdotal sup- 
port for this deduction could he bol- 
stered by experimental data. t 
The same process presumably applies 
to verbal instructions. The child is 
typically reinforced (though with some 
striking exceptions) for doing what 


can learn to patter; 
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others tell him to do. Hence (common 
parental impressions to the contrary 
notwithstanding), a generalized tend- 
ency toward conformity to verbal in- 
structions may be expected to develop. 
With increasing intellectual develop- 
ment, this tendency should come to in- 
clude conformity to fictional examples 
or to abstract ethical exhortations. 


Secondary Reinforcement 


Although no basic distinction is made 
here between primary and secondary 
reinforcement, there is one case fre- 
quently discussed in the literature 
where the acquisition of reinforcing 
Properties by certain stimuli may be 
analyzed in detail. These stimuli are 
those which are connected with care 
of the child by adults, i.e., the non- 
essential aspects of nurturance. These 
include patterns of speech, facial ex- 
pressions, gestures, and the like. Since 
these occur with those nurturant be- 
haviors which are primary reinforcers, 
such as feeding and cuddling, they may 
become Secondary reinforcers. By 
stimulus Seneralization, these behaviors 
should also be rewarding to the child 
(although less So) when produced by 
himself. As the child grows older and 
the parents expect him to take greater 
care of his own needs, he is more aní 
more forced to Provide not only his 
own primary reinforcers, but his ow? 
secondary reinforcers as well. Bena 

« may be expected to show some 0 
the same mannerisms as his parents 
showed when caring for him. 

This kind of learning appears to be 
ene of those processes which Freu 
(1950) includes under the heading O 
identification or introjection, that m 
which abandoned object cathexes be 
come incorporated into the ego. Hoe 
ever, in the view presented here, bis 
Coincidence of abandonment and i 
corporation into the ego refers we 
to Performance, not to learning. gue 
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Secondary reinforcing value of the 
parental mannerisms is built up during 
the period of nurturance, but becomes 
Evident in the child’s behavior as nur- 
aitis begins to be withdrawn. This 
aE process has been discussed by 
Mowrer (1950, Ch. 24) in connection 
With the learning of language, and by 
Lindemann (1944) as a reaction to 
the death of a loved one. Although 
ay process appears better adapted to 
je leming of rather trivial manner- 
BE vie is capable at least in principle 
i veing adapted to more general and 
Significant values as well. 


Kicn, ig 
‘carious Reinforcement 


tious reinforcement does not 
fhe, ne same dignified status in learn- 
reed as do primary and secondary 
ap ee but some such process 
oe necessary 1n order to explain 
cari Tnportant human learning. Vi- 
ae reinforcement involves the gen- 
Bie on of reinforcing effects from 
ine Ts to oneself, hence learning from 
1€ reinforcers which others receive. 
ie act is reinforced for S as a result 
ce act being performed by M, fol- 
ec by reinforcement to M. For 
aaa if S observes M trying to 
ie. problem by certain techniques 
ing ‘ae nage 3 S is more likely to 
A the same techniques when faced by 
Similar problem than if M had failed 
i Solve the problem. Although most 
Ree evidence for such learning a 
e dotal, Lewis and Duncan (195 3) 
T provided some evidence of it n 
Poman gambling situation, and Darby 
it; Riopelle (1959) have demonstrated 
k in discrimination learning by mon- 
Sys; 
Although vicarious reinforcement in- 
Solves selective imitation, it differs 
Serit, the selectivity of imitation, de- 
in ood by Miller and Dollard (1941), 
cee S imitates some Ms and not 
ers because of differential reinforce- 


ment received by S for imitating the 
two M/s. Vicarious reinforcement does 
not inyolve any reinforcers delivered 
directly to S; the discrimination of Ms 
to be imitated or not is made entirely 
on the basis of S’s observation of /’s 
experience. This distinction is empha- 
sized by Campbell (in press). In 
Hullian terms, vicarious reinforcement 
involves the acquisition of K by ob- 
servation, ` à 

This type of learning need not be 
restricted to the effect of particular 
reinforcers administered to M under 
specific conditions. Stimulus general- 
ization should occur not only from M’s 
behavior to S’s but also from one act 
of M’s to another. As a result, if M is 
frequently reinforced, S should find it 
rewarding to resemble M in general, in- 
cluding imitation of some of M’s behav- 
iors which S has never seen rewarded. 
Thus a beginning salesman (S) might 
treat a customer with extreme polite- 
ness because he had observed another 
salesman (M) making a large sale 
while using such behavior, and he 
might also smoke a cigar because he 
had observed his highly successful 
salesmanager (M) doing so. In the 
former case M’s behavior (politeness) 
and M’s reinforcement (a large sale) 
were paired, whereas in the latter case 
the reinforcement (business success ) 
was a perennial experience of M, but 
not paired with the particular behavior 
(smoking cigars) in question. Both, 
however, are examples of vicarious 
reinforcement, 

Vicarious reinforcement corresponds 
to identification as defined by Masser- 
man (1946) and to that aspect of 
identification referred to by Kagan 
(1958) as “the motivation to command 
e desired goal states ofa 
model” (p- 298)- Freud’s (1946) 
identification with the aggressor also 
fits under this heading if successful 
aggression is assumed to be reinforcing 


or experienc 
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to the aggressor. Sanford’s (1955) 
concept of identification also involves 
the adoption as M of someone per- 
ceived by S as successful. 


Conflicting Sources of Reinforcement 


Traditionally the terms identifica- 
tion, introjection, and internalization 
might be applied to any or all of the 
learning processes described above or 
to their end product, similarity þe- 
tween S and some M. Since for the 
most part these processes involve learn- 
ing by imitation, Tequire some kind of 
reinforcement, and result in similarity 
between § and M, it may be asked 
why detailed analysis of the rather 
subtle differences among them is called 
for, 

The answer is apparent when the 
is considered, 
S may be such 


learning processes, 
conflict is between the tendency ț 
imitate Ms whom $ is directly rein- 
forced for imitating (e.g., well behaved 
children) and the tendency to imitate 
Ms whom S perceives as Successful 
(eg., tough kids). Such Conflict js 
inevitable to a certain extent in chil- 
dren, since they are not permitted to 
imitate their (presumably more suc- 
cessful) elders in all respects, but it is 
particularly prominent in members of 
low-status social categories, (e.g., Ne- 


groes) who are often conspicuously 
not reinforced for imitating high-status 
Ms. In the broadest sense, any situa- 
tion where there is discrepancy among 
what S is told to do, what he is re- 
warded for doing, and what he sees 
others doing is a potential conflict situa- 
tion, and one in which the use of any 
single inclusive term such as “identifi- 
cation” obscures the relevant variables. 

The occurrence of conflict among the 
various reinforcement processes makes 
possible a finer analysis of the acquisi- 
tion of values than could be made 
otherwise. If there is perfect agree- 
ment among what S is told to do, what 
those who nurture him do, what those 
around him conspicuously master the 
environment by doing, and what he 
himself is directly Tewarded for doing, 
there is little basis for judging how 
much each of these factors contributed 
to S’s adoption of the values of those 
around him, By observing situations 
Conflict, greater knowl- 
edge of the efficacy of each kind of 
may be obtained. Re- 
ict situations might an- 
Swer such theoretically and practically 
Important questions as: “Does domi- 
Nance or nurturance on M’s part do 
more to make M effective in modifying 
S’s values ?”; “Do words and examples 
completely lose their efficacy if the ap- 
propriate behaviors, when elicited, are 
not reinforced ?” + and “To what extent 
is behavior influenced by Ms presented 
verbally (e.g., in literature) ?” : 

There is of course no special merit 
to the classificatory scheme presented 
here, “xcept for the concept of vi- 
carious reinforcement, the writer has 
avoided attaching distinctive labels to 
the learning processes described. The 
purpose of this discussion was to show 
how the terminology of learning 
theory can be applied to processes of 
value acquisition which have been de- 
Scribed by Personality theorists. This 


, 
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not only serves as a step toward the 
mtegration of these two areas of study, 
but also suggests the probable useful- 
ness of employing such independent 
variables as number, percentage, mag- 
Mtude and delay of reinforcement, 
distribution of practice, and dis- 
Criminability of stimuli in the study of 
value acquisition, As both learning 
theory and personality theory develop 
further, it is to be expected that any 
Schema developed now will be at least 
Partially replaced by newer concepts. 
Rather than developing in further de- 
tail the ideas suggested above, the 
remainder of this discussion will there- 
fore concentrate on the application of 
this kind of thinking to a narrower 
area, the development of conscience. 


CONSCIENCE 

Tiat F a 

i Negative values, or conscience, have 
ceived much more attention than 


D iti ~ pa. 
i Ositive values. Educators seeking 
© improve children’s characters, 


Psychoanalysts concerned with the 
Yranny of the superego, anthropolo- 
te trying to distinguish between 
pate and guilt cultures, and experi- 
Sa psychologists noting the per- 

ence of avoidance responses have 
shared this emphasis on values of the 
hou shalt not” variety. Because 
hile widespread interest in con- 

ce, it is a particularly appropriate 
to illustrate the 


of 


topic with which 

Possibilities of the learning theory 
pP toach to the study of values. Sears, 

al accoby, and Levin (1957), in their 
‘allenging book Patterns of Child 


ao devote a chapter to the a 
cl Pment of conscience 1m preschoo 

dren, Their treatment of the topic 
a point for the 


br Serve as a starting 


“sent analysis. 
Gan 
teria for Conscience 


inoars, Maccoby, and Levin 
ee criteria for recognizing the 


give 


operation of conscience in young chil- 
dren: resistance to temptation, self- 
instructions to obey the rules, and 
evidence of guili when transgression 
occurs. These three criteria are 
treated jointly as defining conscience, 
and no attempt is made to analyze 
their separate developments. Although 
the authors mention that the aspects 
of conscience do not necessarily all 
appear at once, they regard conscience 
as representing an internalization of 
control which is fundamentally differ- 
ent from external control, whether by 
force, fear of punishment, or hope of 
material reward. This treatment of 
conscience as essentially a single vari- 
able seems premature in our present 
state of knowledge; certainly the 
learning theory approach to person- 
ality advocated here would involve 
rate analyses of these diverse re- 


seps 
sponse patterns. 

The first criterion, resistance to 
temptation, may be viewed simply as 
avoidance learning. Although this 
kind of learning is still a focus of 
theoretical controversy, much experi- 
mental data are available concerning it 
(Solomon & Brush, 1956). Sidman’s 
(1953) studies of avoidance behavior 
without a warning signal and Dins- 
moor's (1954) analysis of punishment 
show how feedback from an indi- 
viduals own acts can become a cue 
for avoidance, and how persistent 
such avoidance may be. Although 
children can presumably learn to re- 
spond to more abstract characteristics 
of cues than can animals, there is no 
a child's learning to 
funda- 


reason to regard 
certain behaviors as 
different from a rat’s learn- 
ing to do so. The fact that the child 
avoids the forbidden acts even in the 
absence of the parents is presumably 
due to the parents’ having in the past 
discovered and punished (in the 


avoid 
mentally 
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broadest sense of that word) trans- 
gressions committed in their absence. 

This relating of conscience to avoid- 

ance learning suggests that inde- 
pendent variables known to be effec- 
tive in animal avoidance learning 
would be among the most appropriate 
ones for study in connection with the 
development of conscience in children. 
Within certain limits, the greater the 
intensity of the punishments (Brush, 
1957; Miller, 1951) and the shorter 
the delay between transgression and 
punishment, (Mowrer & Ullman, 
1945; Solomon & Brush, 1956) the 
greater should be the resulting inhibi- 
tion. Though the data are somewhat 
ambiguous, greater certainty of pun- 
ishment might be expected to produce 
inhibition which would be more com- 
plete in the short run but also less per- 
sistent once punishment was perma- 
nently withdrawn (Grant, Schipper, & 
Ross, 1952; Jenkins & Stanley, 1950; 
Reynolds, 1958). This prediction 
suggests that even this one criterion 
of conscience may not be unitary, that 
different laws may apply depending on 
whether one asks how completely the 
child obeys the prohibitions or how 
long he continues to obey them after 
leaving the parental home. Tf partial 
reinforcement should turn out to be a 
crucial variable in the human situa- 
tion, these two criteria might even þe 
inversely related. The prediction 
suggests that the question, “Ts incon- 
sistent discipline bad?” js far too 
simple; one must at least ask, “Bad 
for what?” 

It must also be kept in mind that 
punishment is not restricted to physi- 
cal chastisement or even to noxious 
stimuli in general, including scolding 
and ridicule. Withdrawal of positive 
reinforcers may be very effective as a 
punishment, a fact which complicates 
the analysis. As this is a much dis- 


also 


cussed topic in personality theory, it 
will be considered below. 

Sears, Maccoby, and Levin’s second 
criterion of conscience, self-instruction, 
obviously makes the human case 
different from the animal case, but it 
does not introduce any new motiva- 
tional principle. One of the advan- 
tages of membership in the human 
species is the possibility of using 
verbal symbolization in dealing with 
one’s problems. It is natural that a 
person learning an avoidance, like a 
person learning any other difficult 
response pattern, should give. himself 
verbal instructions, especially since 
verbal coaching by others is so im- 
portant in the learning of social 
prohibitions, Moreover, such self- 
instruction is an imitative act which 
might be learned according to any of 
the reinforcement paradigms dis- 
cussed above, Presumably the learn- 
ing of prohibitions proceeds differently 
in verbal and nonverbal organisms, 
but observations of the relation be- 
tween moral statements and moral 
behavior (Hurlock, 1956, pp. 406, 
411-412) argue against the assump- 
tion that there is a high correlation 
between verbal and other criteria of 
conscience, except as both are influ- 
enced by the values represented in the 
social environment, 


The third criterion of conscience, 
guilt at violations of the prohibitions, 
is itself complex, with many verbal, 
autonomic, and gross behavioral as- 
pects. However, the striking paradox 
about guilt, which has seemed to some 
students to set it apart from the ordi- 
nary laws of learning, is that it often 
involves the seeking of punishment. 
The person who has transgressed, 
rather than trying to avoid punish- 
ment, or even waiting passively for it 
to come, actively seeks out the authori- 
ties, confesses, and receives his pun- 
ishment with apparent relief. He may 
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also, or instead, go to great lengths 
to make restitution. Were it not for 
these phenomena of punishment- 
seeking and self-sacrificing restitution, 
it would be easy to dismiss guilt as 
merely the kind of fear associated 
with anticipation of certain sorts of 
punishment. As it is, the existence 
of guilt serves as an argument for 
regarding conscience as something 
more than the sum of all those avoid- 
ances which have moral significance 
in one’s culture. 

However, the attempt to distinguish 
between guilt-controlled and other 
behaviors has not been very successful. 
Though the distinction between guilt 
cultures and shame cultures has had 
a considerable vogue in anthropology 
(eg., Benedict, 1946; Havighurst & 
Neugarten, 1955; Mead, 1950), the 
inadequacies of the distinction have 
been pointed out by Ausubel (1955) 
and by Singer (1953). Moreover, the 
relation between conformity to a 
standard and guilt when the standard 
has been violated is open to question. 
Shaw (1948) suggests that confession 
may even be so satisfying to some 
people that it constitutes a reinforce- 
ment for sinning. So, although the 
phenomena of guilt may raise difficul- 
ties for learning theory, these diffi- 
culties probably cannot be solved by 
using guilt to define a distinctive kind 
of learning. 

The above considerations should 
suffice to indicate that conscience can- 
not be assumed a priori to be unitary. 
The extent to which short-run con- 
formity, long-run conformity, self- 
instructions to conform, certain kinds 
of distress at having failed to conform, 
and voluntary confession of non- 
Conformity are intercorrelated is a 
matter to be empirically determined. 
Moreover, even if high positive inter- 
correlations are found, it is possible 
that they may reflect correlations in 


the environment rather than any 
fundamental unity of process. If en- 
vironmental pressures toward con- 
formity vary markedly, artificially 
high correlations among the criteria 
of conscience are to be expected. 
However, even when this artifact is 
removed, an analysis of separate 
learning processes for different be- 
haviors may still lead to the prediction 
of high correlations among the be- 
haviors. Such an analysis is presented 
below. 


Learning of .Conscience 


Sears, Maccoby, and Levin found 
that the development of conscience, as 
defined jointly by their three criteria, 
was greater in those children. whose 
parents used love-oriented forms of 
discipline (praise, isolation, and with- 
drawal of love) than in those whose 
parents used “materialistic” forms of 
discipline (material rewards, depriva- 
tion of privileges, and physical pun- 
ishment). A similar finding, though 
not highly reliable statistically, is 
reported by Whiting and Child (1953, 
Ch. 11) in a cross-cultural study of 
guilt as measured by attitudes toward 
illness. This is consistent with the 
widely held view that the acquisition 
of parental values occurs most fully 
in an atmosphere of love (eg: 
Ausubel, 1955; Davis & Havighurst, 
1947). It is possible, however, that 
this finding may be due, not to love- 
oriented discipline as such, but to other 
characteristics of discipline which .are 
correlated with it, The effect of this 
kind of discipline may be to accentuate 
the learning of several different re- 
sponses, all of which contribute to the 
overall diagnosis of high conscience. 

The various kinds of punishments 
commonly applied to children proba- 
bly differ markedly in the temporal 
relations and the reinforcement con- 
tingencies involved. Physical punish- 
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ment is likely to occur all at once and 
be over quickly, while punishment by 
deprivation of objects or a s 
likely to be either for a fixed period o 
time or for as long as the disciplinarian 
finds convenient. Discipline by with- 
drawal of love, on the other hand, 
probably much more often lasts until 
the child makes some symbolic re- 
nunciation of his wrongdoing, as by 
apologizing, making restitution, or 
promising not to do it again. The 
child is deprived of his parents’ love 
(or, the parents would claim, of the 
outward manifestations of it!) for as 
much or as little time as is necessary 
to get him to make such a symbolic 
renunciation. When he has made it, 
he is restored to his parents’ favor. If 
the normal relation between the 
parents and child is one of warmth, 
such discipline strongly motivates the 
child to make the renunciation quickly. 
On repeated occasions of transgres- 
sion, punishment by withdrawal of 
love, and symbolic renunciation, the 
child may be expected not only to 
learn the renunciation response as 
escape from parental 
eventually to use it 
rather than merely an escape response, 
Thus if the wrongdoing is not 
mediately discovered, the child may 
anticipate his parents’ impending dis- 
favor by confessing in advance and 
making the symbolic renunciation, 
The result of this hypothesized 
sequence of events is that the child 
makes a verbal response which is in 
effect an instruction to himself not to 
repeat his wrongdoing. The next 
time temptation comes, he is more 
likely to make this verbal response he- 
fore transgressing, Although this 
does not guarantee that he will not 
transgress, it is likely to reduce the 
probability. If he succumbs to temp- 
tation, he is more likely to confess 
before being caught and thereby 


an 
disfavor but 


as an avoidance 


im- 


avoid the temporary loss of his parents 
love. Thus if the above reasoning iS 
correct, all three criteria of conscience 
should be present to a greater degree 
in the child who has been disciplined 
in this fashion than in other children. 
According to the present hypothesis, 
however, this will be due to the fact 
that punishment continues until the 
child makes a symbolic renunciation, 
rather than to the fact that the pun- 
ishment involves withdrawal of love. 
If physical chastisement or loss of 
privileges are used in the same way, 
the same outcome is predicted. 

A possible weakness of this hypoth- 
esis is that children might learn a 
discrimination between the symbolic 
and the actual avoidances, so that 
they would develop a pattern of vio- 
lating parental standards, immediately 
confessing and apologizing, and then 
transgressing again at the next hint 
of temptation. Tf forgiveness is of 
fered freely and uncritically enough. 
such a pattern presumably does de- 


velop. In this case the correlation 
among the criteria of conseu 
would be expected to drop, actua 


avoidance of w 
being associated 
(For this reasor 
Levin might ha 


rongdoing no longer 
with the other criteria. 
n, Sears, Maccoby, an¢ 
ve found smaller rela- 
tions if they had studied older chil- 
dren.) However, if the parents 
discrimination keeps up with the 
child’s so that the child cannot count 
on removing all the parents’ disfavor 
with a perfunctory apology, the ay 
cacy of this kind of discipline should 
be at least partially maintained. 

If this explanation of greater con- 
Science in children disciplined by 
withdrawal of love is correct, why ee 
Sreater conscience also found with ue 
other kinds of love-oriented control! 
Since these were all found to be inter- 
correlated, and since their relations tO 


œ uni- 
the degree of conscience were 1 


TS 


e a 


LEARNING THEORY AND THE ACQUISITION OF VALUES 327 


formly low, interpretations either of 
separate techniques or of love orienta- 
tion as a general trait are necessarily 
somewhat dubious. As an example 
of the difficulties involved, it may be 
noted that reasoning with the child is 
counted as a love-oriented technique 
Solely on the grounds of its correlation 
with the other such techniques. 
Nevertheless, it shows a higher rela- 
tion to conscience than do two of the 
three clearly love-oriented techniques. 
In view of such complexities, it seems 
legitimate to suggest that the crucial 
factor in those techniques associated 
with conscience may not be love 
orientation as such, but something 
else correlated with it. 

To test this hypothesis, it would be 
necessary to have further detailed in- 
formation of the sort that Sears, 
Maceoby, and Levin used, so that 
disciplinary methods could be classi- 
fied according to the time relations 
discussed above. It is predicted that 
the parents’ tendency to make termina- 
tion of punishment contingent on 
Symbolic renunciation would be cor- 
related with love-oriented discipline. 
However, if each were varied with 
the other held constant, conscience 
Should be more closely related to 
response contingency than to love 
Orientation, 

Along with this overall analysis of 
Conscience, more detailed. analyses 
Could be made of the various com- 
Ponents of conscience. According to 
N€ present view, intercorrelations 
among these criteria would be mod- 
erate for the entire sample and low 
When method of discipline was held 
Constant. 

The learning sequence s 
above js only one of several possible 
€xplanations of the Sears, Maccoby, 
and Levin finding. By suggesting that 
the crucial causal factor is not the dis- 
nction between materialistic orienta- 


discussed 


ton and love orientation, but another 
distinction correlated with it, the 
present hypothesis gains an advantage 
in objectivity and in practical applica- 


bility. Whether it also has the 
advantage of correctness must be 
empirically determined. The chief 


purpose of the present example is to 
point to the availability of such reduc- 
tionist hypotheses in the study of 
values and to argue that they deserve 
priority in the schedule of scientific 
investigation. 


Permanence of Conscience 


It would be particularly desirable to 
have a follow-up study to compare evi- 
dences of conscience in kindergarten 
with those of the same people later in 
life, when they were no longer pri- 
marily under the direct influence of 
their parents. Such a follow-up would 
help to clarify the relation between 
short-run and long-run conformity 
discussed above. Is the child who 
thoroughly obeys all his parents’ pro- 
hibitions also the one who sticks to 
these standards when his parents are 
no longer around and his new asso- 
ciates have different standards? An- 
ecdotal evidence can be cited on both 
sides, though the bulk is probably in 
the affirmative. To the extent that 
current and later conformity are inde- 
pendent, what variables influence one 
more than the other? 

Predictions from learning theory on 
this topic are by no means unam- 
biguous. Nevertheless, two lines of 
reasoning may be suggested concern- 
ing the type of discipline likely to re- 
sult in the most persistent avoidances, 
(Persistence here refers, not to abso- 
lute level of avoidance, but to relative 
lack of decrement in the strength of 
avoidance with time.) 

The first line of reasoning is from 
the differences in the slopes of 
generalisation gradients for different 
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kinds of learning (Dollard & Miller, 
1950). In most cases the poni asi in 
slope is between approach (or excita- 
tory) and avoidance (or inhibitory) 
tendencies. It appears, however, that 
the basic distinction is between re- 
sponse tendencies activated by innate 
and by learned (generally fear) drives 
(Miller & Murray, 1952). When 
stimulus conditions change, the re- 
sultant removal of cues for fear pro- 
duces a greater weakening of response 
tendencies based on fear than of re- 
sponse tendencies based on other 
drives. Hence, the generalization 
gradient of responses and inhibitions 
based on fear is steeper than that of 
other responses and inhibitions. This 
implies that discipline based on fear 
should lose its efficacy more quickly 
than discipline based on rewards as 
distance from the disciplinarian or any 
other change in conditions increases. 
Since this difference in slope is found 
on continua both of distance (Miller, 
1944) and of similarity (Miller & 
Kraeling, 1952; Murray & Miller, 
1952), it seems reasonable to predict 
that it also applies to that complex 
continuum along which an individual 
makes the transition away from pa- 
rental apron strings. Tt would follow 
from this analysis that of two inhibi- 
tions learned in childhood, equal in 
age and original strength, one learned 
from the threat of losing rewards 
would be more effective later in life 
than one learned from the 
provoking threat of punishment, 
In this analysis, the advantages of 
discipline by manipulation (includin 
withdrawal) of reward would apply 
to any kind of reward, material or 
social, not merely to parental love. 
However, the desire to continue re- 
ceiving love from the parents may 
persist after the child has outgrown 
the need for other parental rewards, 
such as gifts and privileges. Dis- 


fear- 
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cipline by withdrawal of love, in an 
atmosphere of warmth, might there- 
fore be even more effective than other 
forms of discipline by denial of reward 
in producing persistent avoidances. 
The other line of reasoning, involv- 
ing the partial reinforcement effect, 
argues for the persistence of cil 
science learned by the process outlinec 
above, in which a symbolic renuncia- 
tion of wrongdoing terminates punish- 
ment. Although the greater resistance 
to extinction of responses which have 
received less than 100% reinforce- 
ment has been demonstrated primarily 
with positive reinforcement, it applies 
to negative as well (Humphreys, 
1939, 1940; Grant, Schipper, & Ro 
1952). Partial reinforcement is o 
course present with all kinds of dis- 
cipline, since punishment depends i 
the parents’ moods and on the socia 
situation, as well as on the child’s 
being caught. However, the above 
analysis of the kind of punishment 
which terminates when the cap 
makes a symbolic renunciation © 
wrongdoing suggests that such dis: 
cipline may involve an additiona 
Source of partial reinforcement. AS 
was indicated above, the child ae 
learn that he can avoid punishmen 
by confessing and apologizing. Wiss 
this happens, the avoidance starts tO 
extinguish. However, the discerning 
parent then learns not to accept 
apology, and the child is punishe 
anyway. The child must then make 
a more vigorous and convincing ee 
bolic renunciation than before in order 
to terminate the punishment. In Ea, 
dition, the discrimination he has mae 
between the symbolic renunciation an© 
the actual avoidance is broken do Wi 
Punishment can only be prevented bY 
actual avoidance of wrongdoing. & 
Owever, after a period of chet ie 
e once more transgresses and t þe 
confesses, he is likely again not to 
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punished. This starts the cycle of 
extinction and reconditioning of the 
avoidance response going again, thus 
continuing to provide a reinforcement 
schedule in which only part of the 
child's transgressions are punished. 
To predict that such partial rein- 
forcement will retard extinction is ad- 
mittedly problematic, both because of 
the complexity of the avoidance 
Paradigm and because the unpunished 
transgressions are assumed to occur in 
blocks rather than randomly. Never- 
theless, the hypothesis deserves con- 
sideration, not only as a prediction 
from learning principles to personality, 
but also as a case where the needs of 
personality theory might guide re- 
Search in learning. 
_ Although these two lines of reason- 
mg agree in predicting maximally 
Persistent conformity to parental pro- 
hibitions by children reared in an 
atmosphere of parental warmth and 
disciplined by withdrawal of love, they 
differ in their other predictions. To 
test these various hypotheses sepa- 
rately would require both short-run 
and long-run analyses of the effects of 
a variety of parental discipline pat- 
terns, ` The following hypotheses 
might be tested: (a) that discipline 
>Y deprivations (whether of things, 
Privileges, or love) has more per- 
Sistent effects than discipline by 
Noxious stimulation (whether physical 
Or social) ; (b) that where the child is 
taught to confess and apologize for his 
transgressions, avoidance behavior 
will go through cycles of extinction 
“Nd reconditioning; and (c) that pun- 
‘Shing only part of a child's trans- 
Tessions results in more persistent 
Obedience than does punishing all of 
em, 


Summary AND CONCLUSIONS 


A suggested that the terms 
“lentification, introjection, and inter- 
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nalization be replaced by detailed 
analyses in learning-theory terms of 
the acquisition of values. A reinforce- 
ment framework for such analyses is 
outlined, and examples are presented 
dealing with the concept of conscience 
and the factors influencing its develop- 
ment. It is argued that this would 
simplify terminology, encourage more 
precise study, and further the integra- 
tion of learning and personality 
theories. 

This analysis, like all attempts to 
integrate the harder-headed and the 
softer-hearted portions of behavioral 
science, is open to attack from both 
sides. On the one hand it may be 
objected that the present treatment is 
too cavalier with the interpersonal and 
intrapsychic complexities of person- 
ality development, that the internaliza- 
tion of values and the identification of 
one person with another cannot be 
treated as though they were nothing 
but the simple learning of a rat in a 
maze. The answer to this objection 
is that no “nothing but-ism” is in- 
tended; it is an empirical matter both 
to determine how far the principles of 
learning (not necessarily simple) can 
go in explaining personality develop- 
ment and to decide how much the ad- 
ditional principles suggested by some 
writers actually contribute to our 
understanding of the phenomena in 
question. The attempt to catch too 
much complexity at a single stroke 
may retard rather than advance our 
understanding. 

On the other hand it may be 
objected that the interpretations given 
here are untestable, that the variables 
involve such diverse and subtle be- 
haviors over such long periods of time 
as to defy adequate measurement. 
Admittedly the questionnaire, inter- 
view, and brief-observation techniques 
used in this area leave much to be 
desired. However, as long as applied 
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behavioral scientists are called upon 
to deal with questions of personality 
development, poor data to guide their 
decisions are better than none. Study 
of learning of values by humans, 
guided by the principles of learning 
based on both animal and human 
studies, has the potential to make vital 
contributions to many theoretical and 
applied areas of knowledge. It is 
hoped that the present discussion may 
contribute something to that goal. 
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A number of studies haye shown a 
relation to exist between word fre- 
quency, word value, and visual duration 
thresholds. Certain issues have devel- 
oped concerning the interpretation of 
the observed relations. Two types of 
interpretation of results can be dis- 
tinguished : 

1. Those interpretations that cl 
word frequency to be the major d 
minant of visual 


This point of view 
responses made to 
presentation of words 
same manner tl 
learned. Differ 
tion thresholds 


aim 
eter- 
duration threshold. 


7 


S as 
y and perceptual de- 
on of the law of 
Thus, the problems that 
exist with regard to the differential 
visual duration thresholds of words are 
problems in learning, not problems in 
perception. 

2. Those interpretations t 
to show that diff 
duration threshol 


perceptual selectivit 
fense is a violati 
parsimony, 


hat attempt 
erences in the visual 
ds of words are due, 
all or in part, to differences jn the affec- 
tive qualities (values, goodness—had- 
ness, affective tone, emotional valence) 
attached to words. These affective 
qualities affect visual duration thresh- 
olds directly, through perceptual selec- 
tivity and/or perceptual defense. Vis- 
ual duration thresholds of words are 
determined by variables that can be 
called perceptual variables instead of or 
along with those variables that usually 
bear upon learning, 


We wish to center our introductory 
discussion around those few studies 
which, for us, best exemplify the two 
Positions described above. Solomon 
and Howes (1951) take the position 
that differences between words can be 
accounted for on the basis of word 
frequency. The perceptual process does 
not differ in any fundamental way from 
the learning Process. They say: 


Suggest that 
tachistoscopi 
effect on wor 


According to Solomon and Howes 
(1951, p, 258) visual duration thresh- 
old can be accurately predicted from a 
knowledge of Population-wide word fre- 
given in the Thorndike- 
(1944). There is no need 


tors op- 

yneratic 

word frequency. These 

frequency account for dif- 

e visual duration thresh- 
olds of words such as those found be- 
tween the visual duration threshvld of 
a value-oriented word for subjects (Ss) 
who score high as opposed to low on 
Specific scales of the Allport-Vernon 
Study of Values, They do not have 
any other influence on visual duration 
threshold, 
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differences in 
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The second position concerning the 
relation between word frequency, word 
value and visual duration threshold 
is clearly expressed by Postman and 
Schneider (1951). Faced with the 
problem of salvaging value as a variable 
influencing visual duration threshold, 
they say that 


P It may... be more profitable theoret- 
ically to regard both frequency of word 
usage and duration thresholds as dependent 
variables, both manifestations of more funda- 
mental psychological properties attributed to 
the organism, such as “habits,” hypotheses, 
or even, perhaps, “personal values” (p. 277). 


While not denying that word fre- 
quency can account for a considerable 
proportion of the variance in visual 
duration thresholds of words, they 
claim that values (called directive fac- 
tors in this paper) directly influence the 
visual duration thresholds of infrequent 
words, 

Postman, in a later article (1953), 
Suggests that the frequency of usage of 
a specific word is determined by its 
reinforcing qualities. He suggests that 
frequent as opposed to infrequent words 
differ systematically in affective tone. 
He presents evidence to show that the 
Sheer number of pleasant words is much 
Sreater than that of unpleasant words. 

‘Ostman says, “Social control over en- 
Vironmental stimuli will, then, tend to 
Create a correlation between frequency 
and positive value” (p. 68). Whether 
Called values, directive states, or affec- 
tive qualities, certain emotive or affec- 
tive aspects of words influence their 
enera] (e.g., Thorndike-Lorge) fre- 
quency, Affective quality determines 
Tequency, not merely in an idiosyn- 
cratic way as Solomon and Howes 
(1951) suggest, but also in a very 
8eneral sense for all words used by all 
individuals. 

It has-been hypothesized from this 
Second position that values can influ- 
Nee visual duration threshold in two 


ways. The affective qualities of words 
influence visual duration threshold di- 
rectly as in the case of an S’s Allport- 
Vernon score influencing his thresholds 
for infrequent value oriented words 
(Postman & Schneider, 1951). (It has 
been shown that the S who values an 
area such as aesthetics highly reports 
infrequent “aesthetic” words at lower 
thresholds than an individual who does 
not value aesthetics highly. However, 
as Solomon and Howes point out, this 
may be due to “idiosyncratic” differ- 
ences in frequency of exposure.) The 
affective qualities of words influence 
visual duration threshold indirectly, 
since the frequency of general usage of 
any word is determined to a consider- 
able extent by the affective tone of that 
word, with this frequency then acting 
as a determiner of visual duration 
threshold (Postman, 1953). 

A third position might also be taken 
concerning the relations between word 
frequency, word value, and visual 
duration threshold. Postman (1953) 
touches upon this last possible interpre- 
tation. He says, “Finally, we cannot 
entirely discount the possibility that 
familiarity resulting from frequency 
may be in itself a source of positive 
value” (p. 68). 

It appears, historically, as if the first 
point of view—that emphasizing word 
frequency, playing down the influence 
of value except as it produces idiosyn- 
cratic differences in word frequency— 
has won out. None of the three posi- 
tions described above have been proven 
to be untenable, however. Indeed, we 
have all of the original questions left, 
plus a few new ones as well. The ques- 
tions that we see as being testable 
and requiring further investigation are 


these: 


1. Is word value related, in a general 
sense, to word frequency? All of the three 
positions described above would admit the 
existence of a relation between value and 
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;—they would diverge in opinion 
eae generality of the relation- 
Yi It is true that a knowledge of word 
Pay allows us to predict visual ice 
threshold. But why is the frequent Woe 
frequent? Perhaps we should go one sp 
beyond frequency to determine hike a ted 

attributes of words vary systematically wit 
B ETAR relation between value and 
frequency exists, does it exist because more 
positively valued words are used more fre- 
quently or because one increases positive 
affect merely by increasing frequency? Or 
does the dependent-independent relationship 
depend on the experimental procedure used? 
3. If value and frequency are related ina 
general, not merely an idio: 
then one would have to s 
influences of each on visual duration thresh- 
old. Are there significant differences in yis- 
ual duration threshold between “good” and 
“bad” words of equal Population-wide fre- 
quency? Are there significant differences 
between frequent and infrequent words equal 
in affective tone? 


syncratic sense, 
eparate out the 


These are the three Problems that 
we have attempted to deal with in the 
series of experiments reported below, 

Our first concern was to determine 
whether a general relation exists be- 
tween word value and word frequency, 
As we noted above, there is Teason to 
suspect that emotional factors do op- 
erate in the building of word frequenc 
In a large sample of words for which 
we have actual measures of goodness 
the words in the Semantic atlas mi 
pared by Jenkins, Russell, and Si i 
(1958), one finds better than a two ts 
one preponderance of 800d over he 
words. While the words in this atl 
are not a random selection of words i 
the English language, the soure : 
which the words were selecte 
unlikely to be systematically þi 
frequency of pleasant 

pleasant words. 

that this ratio is 
entire English | 
1g Proportions 
ant words in 
selected 


€s from 
d seem 
ased in 
as opposed to un- 
One is led to believe 
characteristic of the 
anguage when examin- 
of pleasant to unpleas- 
a sample of 150 words 
at random from the Thorndike. 
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Lorge tables for use in the experiments 
in Series I, discussed below. Of these 
150 words, 95 were on the good end 
(4.00 or less) of the semantic differen- 
tial, a ratio of approximately 2:1. It 
seems highly probable that the sheer 
number of pleasant words in the Eng- 
lish language is far greater than the 
number of unpleasant words. 

Tt seems equally likely that the fre- 
quency with which any specific word 
is used also depends to a considerable 
extent on its goodness or badness: 
hence on its reinforcing qualities. This 
question is amenable to determination. 
As Solomon and Howes (1951) state» 
“In the absence of further data it seem 
best merely to define word frequency 
for English words as the frequency 0! 
words in the Thorndike-Lorge tables 
(pp. 264-265). If affective factors OP 
erate in the building of word freq 
then Pleasantly toned words shou 3 
have higher frequencies in the peer 
dike-Lorge word count than unpleas” 
antly toned words. Experiment I Wê 


A is was 
€signed to determine whether this 
the case, 


Experiment I 


rine 
€ attempted to deter in 
Pleasantness of wor a 
this case Measured by the 800% as 
Scale of the semantic differential" ed 
related to word frequency as meas 
by the Thorndike-Lorge tables. 
$ Selected a sample of word gam” 
the horndike-Lorge tables. THe rds 
Ple of words consisted of two ‘ever 
Selected at random from within in th? 
alphabetic category (except n the 
tables, These words were rate nanti® 
§00d—had dimension of the Er tak 
differentia] by 24 Ss, all freshme urs 
Ing an introductory psychology een the 
€ Tank order correlation bem eae X 
Seodness and the L count ned i ran 
"ese words was +.63, Anoth 
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dom sample of 50 words was taken 
from the Thorndike-Lorge word list in 
the manner described above. These 
words were rated by 28 Ss, a different 
group than those who made the first set 
of ratings. The rank order correlation 
between goodness and L count fre- 
quency in this list was +.40, again sig- 
nificant to the .01 level of confidence. 
A third random sample of 50 words 
was drawn from the Thorndike-Lorge 
tables. These words were rated by 24 
Ss, none of whom had taken part in 
previous experiments. The correlation 
between L count frequency and good- 
ness for this set of words was +.38. 
All of these three correlations are sig- 
nificant to the .01 level. The results of 
these experiments, regardless of differ- 
ences in the magnitude of the correla- 
tions, indicate that there is a positive 
and significant correlation between 
word frequency and goodness. Value 
attributes of words are related to word 
frequency—and not merely in an idio- 
Syneratic way, but in a far more gen- 
eral sense as well. 


EXPERIMENT II 


Solomon and Howes (1951) list 30 
frequent and 30 infrequent words that 
they used in a study of the relation 
of word frequency to visual duration 
threshold. Five frequent and 5 infre- 
quent words had to do with each of the 
Six value areas of the Allport-Vernon 
Study of Values. [Frequent and infre- 
quent words were sometimes, but not 
always, synonymous. We paired these 
Words, matching the first frequent with 
the first infrequent word, and so on, 
through the two lists. Position of pres- 
€ntation within pairs was determined 
PY coin flip. These 30 pairs of words 
were presented to 34 freshman intro- 
ductory psychology students, none of 
whom had taken part in any previous 
Psychological experiments. Ss were 
told to “encircle the most pleasantly 


toned word of each pair.” In 26 of the 
word pairs, the more frequent word 
was chosen by the majority of the Ss as 
the most pleasantly toned. (The 4 
exceptions, in which the least frequent 
word was chosen as the more pleasantly 
toned, were the word pairs limousine- 
automobile, economics-assets, orchestra- 
ensemble, and celestial-heavenly.) A 
sign test (Guilford, 1956, p. 248) indi- 
cates that this preference for the more 
frequent word in each pair could occur 
by chance less than one time in a hun- 
dred. Again, the more frequent word 
is rated as the better word. 

In the experiment discussed below 
we attempted to determine whether fre- 
quency and goodness were related when 
nonsense syllables were used as stim- 
ulus materials. 


EXPERIMENT III 


Twenty-two Ss rated two separate 
visually represented lists of nonsense 
syllables for goodness, as measured by 
the semantic differential. An interval 
of one week separated the ratings of the 
two lists. Each list consisted of eight 
100%, eight 47-53%, and eight 0% 
association value (Glaze, 1928) non- 
sense syllables in random order. Syl- 
lables were selected at random from 
within each block of ten Glaze syllables 
as listed in the Handbook of Experi- 
mental Psychology (Stevens, 1951, pp- 
540-546) at each of the association 
values used. Then the eight syllables 
used in each list at each association 
level were drawn from this pool of 
randomly selected syllables. We be- 
lieved that if frequency of exposure was 
related to goodness, then the higher the 
association value of the syllable, the 
better the syllable should be rated. This 
seemed likely, since the association 
value of nonsense syllables appears to 
be determined largely by the frequency 
of occurrence of the letter combina- 
tions in meaningful words ( Underwood 
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1959). Table 1, showing two sets of 
syllables and their mean ratings of 
goodness, indicates this to be the case. 
A Mann-Whitney (1947) test of the 
significance of differences between syl- 
lables of 100% and 47-53% association 
values in List 1 indicates that the 100% 
list is rated as somewhat better (P 
<.10). The difference in goodness 
between 47-53% and 0% syllables is 
significant (P < .01); Mann-Whitney 
tests of List 2 show the 100% and the 
47-53% list to differ in goodness (P 
< .01); and the 47-53% list and the 
0% list to differ significantly (P 
< .05) as well. 
Just as the goodness and the fre- 
quency of meaningful words in the 
English language are related to one 
another, so also is goodness related to 
the association value of nonsense syl- 
lables ; the higher the association value, 
the better the syllable, 
Our data indicate that value and fre- 
quency are related. We have moved at 


TABLE 1 
RATINGS oF NONSENSE SYLLABLES oF DIr- 
FERING ASSOCIATION VALUES ON THE 
Goop-Bap DIMENSION oF THE 
SEMANTIC DIFFERENTIAL 


syllables] Rating Syilabigs| Rating Syltbies {Rating 

List 1 
BUL | 4.72 459 | zeo |491 
FES | 3.41 4.33 | wun 4.48 
HON | 2.11 4.33 | x1w 5.36 
JIN | 2.98 4.00 wug | 5.26 
RAC | 3.98 3.46 | Qm 4.87 
Rov | 3.74 2.93 | Guy 4.39 
SUR | 2.72 3.52 | vec 3.91 
WIL | 2.46 4.30 | xur 5.28 

List 2 
DUL | 4.27 | nek |346 | yur | 4.46 
BAL | 2.35 | piy | 5.17 | gir | 4.44 
Lov | 1.44 | wep | 4.71 | zux 5.04 
SAR | 3.23 | RIQ 4.46 | zr 4.06 
LIK | 2.50 | var | 3.96 | xap |485 
HAV | 2.35 | Lug 4.15 | xuc | 5.35 
WAT | 3.46 | Gow | 3.75 QUJ 5.00 
ROV | 3.54 | ruv | 3.83 GUQ | 5.98 
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least somewhat beyond the empirical 
fact that word frequency is related to 
visual duration threshold. We know 
something else about the stimulus qual- 
ities of frequent as opposed to infre- 
quent words; frequent words are not 
only more frequent but are better as 
well. Our second question then be- 
comes relevant. Does a manipulation 
of frequency cause systematic variation 
in value or do differences in value pro- 
duce differences in word frequency, or 
does the independent-dependent status 
of each of these variables vary depend- 
ing on the experimental procedures 


used? Experiment IV bears upon this 
question, 


Exvertment IV 


Fourteen Ss rated the goodness of 
twenty nonsense words. These words 
were from the list of words used by 
Solomon and Postman (1952) in their 
study of the effect of “built in” fre- 
quency on visual duration threshold. 
The words used Were: JANDARA, AF- 
WORBU, BIWOJNI, NANSOMA, ENANWAL, 
IKTITAK, SARICIK, ZABULON, CIVADRA, 
LOKANTA, KADIRGA, ADAFNAW, BOR- 
ULCE, NIJARON, ENSHIMI, INDULAM, 
TAVHANE, UDIBNON, DILIKLI, and MEC- 
BURI. Mean goodness scores were ob- 
tained. One week later each S was 
tested individually. When the S en- 
tered the experimental room he was 
given the following instructions (from 
Solomon & Postman, 1952) : 


This is an experiment concerning the effec- 
tiveness of Tepetition in learning to pro- 
ounce strange words correctly. It has 4 
direct bearing on the problem of reading 
Words in a foreign language, as compared to 
hearing the words spoken. In addition, we 
are interested in knowing whether the rela- 
tive effectiveness of the two kinds of learning 
methods depends on general reading ability: 

© are going to give you a deck of cae 

n each card is printed a strange word. W 
would like you to look at each card careful 7 
and then pronounce the word in the way ? 
Would be pronounced if it were a word in the 


=, 
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English language. Proceed steadily from 
card to card, turning over each one after you 
have finished with it. Go right through the 
deck and then stop unless you have serious 
doubts about your pronunciations. 


The experimenter (E) handed the S a 
pack of 90 3” X 5” cards. In this pack 
were the words listed above. The first 
five words above were presented 10 
times apiece, the next five, 5 times 
apiece, the next five, 2 times apiece, 
and the last five, once each. The cards 
were shuffled thoroughly after each S 
completed this task, so that they were 
in a relatively random order. 

All responses were taped in order to 
lend realism to the expressed purpose 
of this new situation. Since Ss had 
participated in another experiment on 
the pronounceability of words, it seems 
likely that (although the Ss had seen 
the words before while rating them on 
the semantic differential) they were 
misled as to the purpose of this ex- 
Periment. Each S rated the nonsense 
Words a second time following this pro- 
Nounceability test. 

Figure 1 indicates the mean pre- 
€xposure and postexposture ratings for 
Words of each frequency of exposure. 
While the differences between ratings 
Prior to exposure were not significant, 
a Kruskal and Wallis H test (1952) 
Shows that words of differing frequen- 
cles of exposure do differ significantly 
in the postexposure test (P < .01, H 
= 7.83). 

_In this experimental situation a ma- 
Nipulation of word frequency alters Ss’ 
evaluations of word goodness. One 
Can, however, think of instances where 

€ reverse might well occur. The most 
bvious example of this is in verbal 
Conditioning, where the differential re- 
Inforcement of a class of words pro- 
duces systematic variation in the fre- 
Weney with which this class of words 
'S emitted. We would suspect that in 
this case frequency of emission (and 
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Fic. 1. Ratings of nonsense words before 
and after word frequency was manipulated. 


hence of reinforcement) alters the af- 
fective quality of the words. Reinforce- 
ment might then produce further varia- 
tion of frequency. 

No matter whether frequency of ex- 
posure alters value or vice versa, or 
both, one thing is evident—there is a 
relation between word frequency and 
Ss’ ratings of word goodness. Since 
this is so, our third question also be- 
comes relevant. Since value and fre- 
quency are generally related, it became 
necessary to experiment with samples 
of words matched in frequency, varying 
in goodness, plus samples of words 
varying in frequency, matched in good- 
ness, in order to separate out the influ- 
ence of value and of frequency on 
visual duration threshold. This was 
attempted in Experiment V. 


EXPERIMENT V 


We had various groups of Ss rate 
the goodness of a large number of 
words on the semantic differential. 
These groups rated 60 words of a 
Thorndike-Lorge G count of 12, 13, or 
14, three random samples of 50 words 
each (described in Experiment I), and 
39 words, one sample of 30 words plus 
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TABLE 2 
WORDS USED IN THE TACHISTOSCOPIC EXPERIMENT 

Words of Matched Goodness Words of Matched Frequency 

ei Bad 

Frequent Infrequent Good a 

5j Thresh- 

; Thresh- SD |Thresh-| y G |Thresh-| y „G 
Word Reng old Word Rating} old Word Count} old Word | Count] old 
Sampie 1 osoz [income | 2.52 | osoo | Rany | 12 | 1092 Stammer| 12 | .1538 
Buter | 17 | ‘besa | encom ieg | 4985 | Glisten | 13 | (0777 | Comune) 13 ate 
Doctor 1,60 | .0369 | Ivory 1.68 | .0262 | Prosper | 13 1046 | Defect 14 "0969 
Health 1.97 | .0492 | Piano 1.97 | .0546 | Ivory 13 -0262 | Hinder 13 “1285 
Sleep 2.22 -0362 | Radiant! 2/08 0662 | Ginger 13 0446 | Wretch 12 0785 
Sweet 1.93 -0631 | Patriot 1.90 0992 | Alert 14 -0392 | Sneer 12 1123 
Mother 173 | 0315 |Vision 1.57 | .0546 | Deputy | 13 «0446 | Scandal | 13 1162 
House 246 | .0323 | Agile 2.74 | .0623 | Vehicle 13 +0492 | Dismal | 13 0469 

Flower 1.87 | .0369 | Bible 1.87 | .0462 | Caress | 13 -1085 | Penalty | 13 
Sample 2 7 0713 

Mountain] 2.73 | 0687 | Bodice 2.74 | 0965 | Bodi 2 . Slime 4 < 
Water | 243 | “9682 Dough | 2.47 | ‘0757 | Canas | 32 0409 | Beugar | 29 a 
Window | 2.68 | “0400 | Gradual] 2:42 0965 | Engine | “A | ‘oses | Bese A | di7 
Green 2.93 | 10296 3.00 | 10852 | Brave A | .0739 | Anger A 10948 
Smooth 2.57 | 0522 2.62 | .0330 | Statue 32 -0748 | Divorce | 29 "1048 
Father 2.36 0800 7 2.49 0800 | Cushion 21 -0900 | Inferior 19 “0687 
Pretty | 213 | ‘0661 | Re reali | 2.09 | 0661 | Garment| 2 11043 | Grief 45. | 068! 
Sires’ | 2:77 | iois |Lentent | 32° 1522 | Church | AA | ‘0530 | Bacber | Aa | i0 


9 words which happened to 
the other lists, f 
atlas. From 

Jenkins’ atlas, 
words matched 
in goodness, 


appear on 
rom Jenkins’ semantic 
these words, and from 
We selected 17 pairs of 
l in frequency, varying 
and 17 pairs of words 
matched in goodness, varying in fre- 
quency. (Some words among those 
matched in goodness were rated by 
Jenkins’ Ss and by our own, Although 
the rank order correlation of goodness 
ratings between our own and Jenkins’ 
sample of Ss was +.98, for a 


group of 
30 of these words, there was some 
variation in ratings between the two 


samples. Whenever this occurred, we 
used the mean goodness rating assigned 
by our Ss.) The first 9 words from each 
list were presented to the first group of 
26 Ss; the last 8 to a second group of 
23 Ss. The four lists of words, divided 
as they were presented to two sample 
populations, are presented in Table 2. 
The words were put on slides in the 
form of bold faced capital letters. The 
words were presented to Ss by means 
of a Revere 888 tachistoscope. Each 5 


sat 18 feet from the screen on which 
the words were Projected. The testing 
room was fully illuminated while me 
tachistoscope was used at the smalle 
Possible lens opening. The ratio O 
room illumination to screen illumina- 
tion was 1 :1,08, The words were pre- 
Sented in random order with the order 
remaining constant in all presentations 
to any specific S but with the oran 
varying between Ss. A modifiec 
method of limits was used in which 
each of the 36: words was presente? 
once at each of the following speeds: 
1/100, 1/50, 1/25, 1/5, 1/2, and 1 sec- 
ond. Under the conditions of praen 
tation used in this experiment, only 7 
Word was recognized at 1/100 a 
Second while all words were recognized 
at 1 second, 

Each S’s mean visual duration thresh- 
old for frequent as opposed to infre 
quent and good as opposed to si 
words was obtained, The Ss in ihe 
first sample (N = 26), exposed to i 

Tst group of words, differed ngm 
cantly in mean thresholds for good V=: 
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bad words (¢ = 3.56, P < .01, using 
the formula for correlated data), with 
good words reported at lower thresh- 
olds. These Ss also differed in mean 
thresholds for frequent vs. infrequent 
words (t= 2.33, P < .05), with fre- 
quent words reported at lower thresh- 
olds. The Ss in the second sample (N 
= 23), exposed to the second group of 
words, differed significantly in mean 
thresholds for good vs. bad words (t 
= 3.25, P < .01), with good words re- 
ported at lower thresholds, and in 
threshold for frequent vs. infrequent 
words (¿= 3.82, P < .01), with fre- 
quent words reported at lower thresh- 
olds. 


Discussion 


We attempted to obtain data relevant 
to three specific questions, Let us look 
at these questions, along with the rele- 
vant data. 


1, Is value related, in a general sense, to 
Word frequency? Our data is unequivocally 
affirmative. Whether dealing with samples 
of words in the English language, with fre- 
quent vs. infrequent words in various All- 
Port-Vernon value areas, or with nonsense 
Syllables, frequency, as measured by the 
Thorndike-Lorge tables and by association 
value, and positive value, as measured by the 
good-bad scale of the semantic differential, 
are significantly related in a general sense. _ 

2. If this general relation exists, does it 
exist because more positively valued words 
are used more frequently or because one in- 
creases positive value merely by increasing 
frequency ? We found that a manipulation 
of frequency produces systematic variation m 
Ss’ ratings of the goodness of words. It 
Seems highly probable, however, that the 
Teverse may also be true—that by increasing 
the goodness of words, one also increases 
equency. At this point it seems likely that 

le independent-dependent relation between 
Zoodness and frequency would depend on the 
Experimental situation. We would not haz- 
ard a gu s to the effect of each on the 
other in general speech, although a relation 
tween frequency and goodness has been 
Shown to exist. 

The data gathered in the first three 
experiments suggest that other attrib- 
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utes of the stimulus vary systematically 
as one varies frequency. Perhaps this 
is true for other forms of learning also. 
The fourth experiment would suggest 
the possibility that a manipulation of 
the frequency of occurrence of a stim- 
ulus and a contiguous response, in it- 
self and without any regard to drive 
reduction, changes the reinforcing qual- 
ities of the stimulus. Perhaps we 
should look beyond the influence of fre- 
quency on response probability and ask 
why frequency changes response prob- 
ability. Is it the difference in associa- 
tion value (and hence, in frequency of 
occurrence) that causes 100% associa- 
tion value syllables to be learned more 
rapidly than 0% syllables? Or is it the 
variation in affective quality accom- 
panying differing frequencies of occur- 
rence? Since the relation between 
goodness and frequency is not perfect, 
one can obtain syllables that vary from 
0 to 100% in association value, yet are 
equal in semantic rating—this should 
provide a test for our question. 

These results seem to have consider- 
able social consequence. If word fre- 
quency and word goodness are related, 
and if a manipulation of the frequency 
of occurrence of a word produces sys- 
tematic variation in word value, then 
one wonders whether ideas and per- 
sonal values are manipulable in the 
same fashion. If so, then perhaps those 
elements of society that believe in cen- 
sorship would find support in these ex- 
periments for one of their major as- 
sumptions, viz., that frequent exposure 
to an idea or‘a set of behaviors—vio- 
lence, drinking alcoholic beverages, or 
what have you—lessens the “evil” asso- 
ciated with the idea of behavior, The 
validity of their second major assump- 
tion—that if we make a thing evil 
enough, people won't do this thing—is, 
of course, a very different problem. 

If the more frequent becomes dem- 


onstrably better merely because it is 
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more frequent, one can well understand 
the values of advertising. If it is gen- 
erally true that the most frequent words 
in the English language are also the 
most good, the logical conclusion that 
this would lead to is that ads would be 
more effective if written at an even 
lower vocabulary level than is currently 
used. “Oh, brave new world. . , .” 


3. If value and frequency are related, what 
are the influences of each on visual duration 
threshold when the other variable is held 
constant? Words equal in frequency, vary- 
ing in value, are correctly reported by Ss at 
different mean thresholds, with good words 
being correctly identified at significantly 
lower thresholds than are bad words. Words 
equal in goodness, varying in frequency, are 
reported by Ss at different mean thresholds, 
with frequent words being correctly identi- 
fied at significantly lower thresholds. 


While it has been generally accepted 
that values have Some secondary effect 
on visual duration thresholds, it is also 
generally accepted that the influence of 
values on threshold is small, producing 
idiosyncratic variations in frequency 
and hence influencing thresholds in this 
way. Our results indicate that values 
influence threshold even when fre- 
quency of exposure js held constant, 
and that this influence is approximately 
equal to that of frequency with value 


held constant, using these lists of 
words. 


Since word goodness, 
from frequency, does significantly influ- 
ence visual duration threshold, we are 
left with the problem of “Why?” The 
terms “perceptual selectivity” and “per- 
ceptual defense” are descriptive, not 
explanatory. We have „0O answers 
here, although we would like to discuss 
several possible explanations. 

One interpretation of the fact that 
good words are perceived at lower 
threshold than bad words of the same 
frequency might be called the “simple” 
guess theory. Good words appear to 
he about twice as common as bad words 


independent 


in the English language. Specific good 
words are likely to be more frequently 
used than specific bad words. It seems, 
then, that good words occur between 
three and four times as frequently as 
bad words in terms of usage. Even 
though the frequency of occurrence of 
the stimuli are equaled, by matching 
good and bad words for Thorndike- 
Lorge frequency, response probabilities 
probably are still not equal. If an S 
sees a word in a tachistoscope but iden- 
tifies only a “g— qd ” he is prob- 
ably three or four times more likely, on 
the basis of responses made in the past, 
to fill in the gaps with a good word, 
such as “gradual,” rather than a bad 
word, such as “goudge.” This simple 
guess explanation may he partially cor- 
rect but is not too satisfying to us, since 
an examination of our Ss’ responses to 
words on the good-bad lists indicates 
that there seems to be a rather strong 
likelihood that when Ss guess at but 
miss a bad word, they respond with an- 
other bad word—incorrect but still bad, 
while when they respond incorrectly to 
the tachistoscope Presentation of a good 
word, the incorrect response is usually 
another good word. This finding runs 
Contrary to the idea that perceptual de- 
fense is a Significant variable but also 
suggests that something more systemi- 
atic than mere guessing is involved in 
errors and in correct responses to tach- 
istoscopically presented words. ) 
second explanation for the dif- 
ferences in Sy’ mean visual duration 
thresholds might be called the “com- 
Plex” guess theory. This explanation 
is derived from Bruner and Postman’s 
(1949) hypothesis theory of perception 
and cognition, According to Postman 
(1951) hypotheses are, “in the most 
Seneral sense, expectancies or predis- 
positions of the organism which serve 
to select, organize, and transform the 
stimulus information that comes from 
the environment” (p. 249), The 
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strength or availability of certain hy- 
potheses is stronger than that of others. 
This strength is a function of the fol- 
lowing factors: frequency of past con- 
firmation of the hypothesis, number of 
alternative hypotheses, motivational rel- 
evance, and cognitive support. The 
first two factors—frequency of past 
Confirmation and number of alternative 
hypotheses—fit well into the simple 
Suess theory, while the last factor, that 
of cognitive support, may not be rel- 
evant to this discussion. The factor of 
motivational relevance might possibly, 
however, produce differential predic- 
tions between the two explanatory de- 
vices that we have labeled simple 
and complex guessing. As Postman 
(1951) puts it: “The stronger the mo- 
tivational support for a given hypothe- 
sis, the less of appropriate information 
is required to confirm it” (p. 255). 
We would limit this statement some- 
what by tying it only to the effect of 
Teward or reinforcement on the number 
of cues required for the confirmation of 
a hypothesis. It may be that the dif- 
ferential reinforcement of various stim- 
uli causes those stimuli (as well as those 
Tesponses) that have most frequently 
led to reinforcement in the past to be 
more readily observable or available in 
the present. The availability of a spe- 
Cific response may well depend on the 
frequency that this response has been 
Tewarded in the past—but perhaps be- 
Cause that aspect of the stimulus situa- 
tion that has called forth the rewarded 
Tesponse in the past has become more 
Salient—has become more of a “figure” 
aS opposed to the other stimuli becom- 
ing more like the “ground.” To para- 
Phrase our quotation from Postman 
given above: The more reinforcing or 
rewarding a word is, the less cues are 
Needed for the recognition of this word. 

© are advancing the proposition that 
reinforcement not only influences the 
Probability that a given response will be 
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emitted, but also that reinforcement 
causes that aspect of the stimulus situa- 
tion that is salient at the time that the 
rewarded response is made to become 
more likely to be salient at the next 
presentation of the stimulus. If this is 
true, then even if word frequencies of 
good and bad words are equal and even 
if habitual modes of response (such as 
guessing good words) are controlled, 
the stimulus qualities of rewarding 
or reinforcing words would be such 
that less information is needed before 
the word is correctly recognized than 
would be the case for words which 
were presented with equal frequency 
but were not equally reinforcing. Some- 
thing that might be called perceptual 
sensitivity might then be said to exist. 
This sensitivity would not, however, be 
a unique perceptual process, but would 
instead depend on those old standbys in 
all forms of learning: frequency and 
reinforcement, a little intermingled to 
be sure, but still observably present. 

We have no way of testing out the 
“why” question with these data, al- 
though the types of wrong guesses 
made in our study incline us away from 
the simple guess theory. We believe 
that the data presented in this study are 
valuable chiefly in reviving the concept 
of perceptual selectivity, not in solving 
the problem of why what is called per- 
ceptual selectivity exists. By system- 
atically varying frequency of exposure 
and frequency of reinforcement and 
nonreinforcement of nonsense syllables 
in future experiments, we hope to even- 
tually be able to deal with the question 
of why good words are perceived at 
lower mean thresholds than are bad 
words. 


SUMMARY 


Three positions may be taken con- 
cerning the relation between word value 
and word frequency, and the relation of 
each of these to visual duration thresh- 
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old. Two of these points of view would. 


suggest a general relation to exist ue 
tween word frequency and word valué 
Three experiments demonstrate the ex- 
istence of this relationship. The more 
frequent a word or a nonsense syllable 
occurs in the English language, the 
better it is likely to be rated on the 
good-bad scale of the semantic differen- 
tal Since this relation does exist, we 
attempted to test out, in Experiment 
IV, one possible explanation of why the 
relation exists. A manipulation of the 
frequency of nonsense words produces 
systematic variation in the rated good- 
ness of these nonsense words. Words 


that are frequent are also very likely to 
be rated as good, 


out the influence of 
of word freq 
threshold, 


; matched 
frequency, 


quency and value appear to 
producing differentia] Visua 
thresholds for words, 

planations and certain jn 
these findings were discus 


operate in 
l duration 
Possible ex- 
plications of 
sed. 
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Rating scales have been so widely 
used in psychology, both as research 
tools and in practical applications, that 
the determination of the optimum num- 
ber of rating categories has been an im- 
portant research problem in itself. The 
usual attack on this problem has been 
to determine the effect of number of 
rating categories on interrater reliabil- 
ity, and research on these lines has been 
well summarized by Guilford (1954, p. 
289 ff.). 

The purpose of a rating scale is to 
allow a rater or group of raters to dem- 
Onstrate perceptual discrimination of a 
Set of stimuli or objects to be rated. In 
Other words, our concern is not with 
the rating scale per se, but rather with 


` the extent to which different rating 


Scales allow us to measure discrimina- 
tion between the stimuli rated. And 
the basic question about number of rat- 
Ing categories is simply whether for any 
81ven rating problem there is an opti- 


l This research was done in part under 
Contract N5-ori-166, Task Order 1, and in 
Part under Contract Nonr-248(55), between 
the Office of Naval Research and Johns 
Topking University. This is Report No. 6 
Under the latter contract. Reproduction in 
Whole or in part is permitted for any purpose 
Of the United States Government. Apprecia- 
tion is due to Robert Austin, Allen Madans, 
and Natalia Potanin for aid in various cal- 
culations, 


mum number of rating categories, or at 
least a number of rating categories be- 
yond which there is no further im- 
provement in discrimination of the 
rated objects. 

Techniques directly concerned with 
discriminability of stimuli have rarely 
been used for this problem, and yet 
since the problem basically is one of 
stimulus discrimination, it seems most 
appropriate that they should be. 

Two techniques which have been suc- 
cessful in dealing with related problems 
are the use of information transmission 
measures and the use of discriminabil- 
ity scaling procedures. Garner and 
Hake (1951) described how informa- 
tion transmission measures can be used 
with the absolute judgment problem to 
determine the effective discrimination 
between the stimuli of a set. While 
there are ramifications of the absolute 
judgment procedure which are differ- 
ent from those of the rating scale situa- 
tion, the problems are sufficiently alike 
that we can expect information trans- 
mission measures to be useful. Bendig 
and Hughes (1953) and Bendig (1954) 
have in fact used the information par- 
adigm for the rating scale problem, and 
came to the conclusion that discrimina- 
tion increases with an increase in num- 
ber of rating categories up to at least 


11 categories. 
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In the same paper, Garner and Hake 
(1951) described a discriminability 
scaling procedure and pointed out the 
relatedness of this procedure to the use 
of information transmission measures. 
To our knowledge, however, discrimi- 
nability scaling procedures have not 
been used in determining the optimum 
number of rating categories, ; 

The purpose of the present paper is 
to show how both information measures 
and discriminability scaling procedures 
can be used to aid in the solution of the 
rating scale problem. In addition, it 
will be shown that these two proce- 
dures are quite closely related, and that 
measures from one can be used to esti- 
mate the other. An illustrative set of 


data will help clarify the nature of these 
relations. 


Tue DATA 


The data used in these experiments 
were ratings of the legibility of hand- 
writing samples. We obtained 20 dif- 
ferent samples of handwriting of “The 
quick brown fox jumped over the 
dogs,” chosen to give us a reas 
spread of legibilities. The samples were 
arranged on a Single mimeographed 
sheet, one to a line. A total of 50 raters 
then rated each sample on a rating scale 
with 4, 6, 8, 10, 12, 16, and 20 cat- 
egories. Each rater rated all Samples 
with each rating scale, using one rating 
scale until all samples were rated on 
that scale. Different Taters used the 
rating scales in different counterbal- 
anced orders, and the samples of hand- 
writing were randomized differently on 
each sheet to minimize any time errors, 

The raters were randomly selected 
from available graduate and undergrad- 
uate students. Since each of 50 raters 
rated each of 20 samples of handwriting 
on each of seven different rating scales, 
we had a total of 7,000 ratings as our 

basic data. 
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INFORMATION TRANSMISSION 


The first analysis was to determine 
the information transmission, U(R:S), 
between stimuli, or samples, and re- 
sponses for each of the rating scales. 
(While we are using the term informa- 
tion transmission, we will use the more 
general notation of Garner and McGill, 
1956, in which U(R:S) isa contingent 
uncertainty between responses and stim- 
uli.) All 50 ratings for each sample 
were pooled for this analysis. Since 
U(R:S) is limited by the number of 
response categories, the various rating 
scales allowed maximum information 
transmissions from 2.00 bits for four 
categories to 4.32 bits for 20 categories. 
The open squares of Figure 1 are 
the information transmission measures 
computed from the basic data, and these 
data show a Consistent rise in informa- 
tion transmission as the number of rat- 
ing categories is increased. This result 
is in agreement with the results of 
Bendig and Hughes (1953) and Ben- 
dig (1954). These calculations, how- 
ever, are subject to a bias in the infor- 
mation measure which changes progres- 
sively with number of rating categories. 
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ures of information transmission. The ue 
circles Tepresent the same data corrected ag 
bias by the Miller-Madow correction. poe 
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Sent estimates of information transmis! 
obtained from discriminability functions.) 
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Each matrix has the same total number 
of ratings, but when the number of rat- 
ing categories is increased, these rat- 
ings are distributed more thinly over 
the larger number of cells in the matrix. 
Miller (1955) has described a correc- 
tion for this bias which is known as the 
Miller-Madow correction, and had him- 
Self suggested (Quastler, 1955, fn., p. 
16) that Bendig’s results would have 
been different if the correction for bias 
had been applied. 

The correction for bias of informa- 
tion transmission is always in the direc- 
tion of decreasing the estimate of trans- 
mission, and the fewer the number of 
cases per cell, the greater the correc- 
tion; i.e., the greater does the directly 
calculated information transmission ex- 
ceed the true value. When the Miller- 
Madow correction is applied, the result 
is that shown by the open circles of 
Figure 1. This result, which presents 
More nearly the true picture, shows 
Considerably less effect of number of 
Tating categories than does the result 
using raw measures. The correction 
flattens out the function. However, it 
is clear that there still is some effect 
of number of rating categories, and that 
an increase in the number of categories 
Mcreases information transmission even 
Up to 20 categories. 

The two lines drawn in Figure 1 are 
linear functions, but they should be 
Considered as entirely empirical de- 
Scription, since logically the functions 
Cannot be linear much beyond the range 
of conditions investigated. For ex- 
ample, if only one rating category is 
Used, there can be no information trans- 
Mission, but the functions drawn do 
Not extrapolate to zero with one rating 
Category, Somewhere below four cat- 
©8ories they must slope rapidly down- 
Wards, 

On the other hand, there cannot be 

Meaningful increase in information 
Tansmission if more than 20 rating 


categories are used. With 20 or more 
rating categories, each rater can put 
each stimulus in a different category, 
and the rater can evidence no more 
discrimination than to rate each stim- 
ulus differently. And clearly the de- 
gree of discrimination by the group is 
limited by the discrimination of each 
rater. Actually the directly computed 
information transmission can increase 
with an increase in number of rating 
categories until each rater-sample com- 
bination produces a different rating. 
The function corrected for bias, how- 
ever, would not increase concomitantly, 
since the bias correction would increase 
at approximately the same rate as the 
increase in number of rating categories. 
In the limit, the information transmis- 
sion corrected for bias becomes degen- 
erate, since with a large enough number 
of categories and a small enough num- 
ber of ratings, it is impossible to dem- 
onstrate that the measured information 
transmission is different from the value 
expected on the assumption of no true 


contingency. 
DISCRIMINABILITY FUNCTIONS 


The second method of determining 
the amount of discrimination in the 
ratings is to construct discriminability 
scales for the set of stimuli. The dis- 
persion of the scale values is then a 
measure of the relative discrimination 
between the various stimuli. We have 
carried out the necessary scaling pro- 
cedures for these data in two different 
ways. 


Pooled Data 

With the first method, the data 
were kept in the pooled form in which 
they were used to compute information 
transmission. The S-R matrix of fre- 
quencies was converted into a set of 
equivalent pair comparisons with the 
technique described by Guilford (1938). 
The essence of this procedure is that 
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the distribution of responses for each 
stimulus is compared category by cat- 
egory with the distributions for each of 
the other stimuli to obtain a measure of 
the equivalent number of times that 
each stimulus would have been tated 
greater than each other stimulus, as- 
suming the ratings to be uncorrelated. 
From the matrix of equivalent pair 
comparisons, a discriminability scale 
was then constructed following the 
usual procedure for pair comparisons. 
One such discriminability scale was 
constructed for each rating scale, 


Individual Data 


With the second met 
were changed to equiv. 
parisons one rater at 
each rater’s ratings of the 20 stimuli 
on one rating scale were changed into 
equivalent pair judgments in much the 
same way as Guilford (1954, p. 183 
ff.) describes for converting ranks. 
Ties were used where necessary. These 
individual pair comparisons were then 
pooled for all 50 raters, and a dis- 
criminability scale constructed—again 
one for each tating scale. We thus 
have 14 discriminability scales, one for 


each of seven rating scales with each 
method, 


The purpose in carrying out the two 
kinds of analysis was to determine 
whether the pooling of the data for the 
informational analysis had had any sys- 
tematic effect on the fy 
information transmissio 
rating categories. With a complete 
multivariate analysis of data such as 
these, we can determine Separately the 
extent to which responses can be pre- 
dicted from samples, from raters, and 
from the interaction between samples 
and raters. With only one rating per 
sample per rater, however, we have no 
choice but to pool the data, and are in 
effect accepting the interaction term as 

an error term rather than as a predict- 


hod, the ratings 
alent pair com- 
a time. Thus 


nction relating 
n to number of 
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ability term. We are in effect also 
accepting the contingent uncertainty be- 
tween responses and raters as an error 
term. This last factor, which indicates 
the extent to which different raters 
use different responses on the average, 
could change with an increase in num- 
ber of rating categories. For example, 
if all raters used rankings when 20 cat- 
egories were allowed, then there could 
be no difference in use of responses by 
different raters, but such differences 
could exist with smaller members of 
categories. Thus, although it is not 
feasible to measure such a term di- 
rectly, it is possible that the effect 
of this factor systematically decreased 
with the increase in number of rating 
categories, with the resultant increase 
in the information transmission which 
was obtained, 

The first discriminability scale anal- 
ysis used the pooled ratings to convert 
to equivalent pair comparisons. Thus 
any systematic differences between rat- 
ers in ratings would still affect the final 
Scale values. With the second scaling 
Procedure, any such differences were 
removed by changing the data to equiv- 
alent pair comparisons before pooling. 


Correlations of the Scales 


The various scales produced by the 
two different methods are essentially 
identical, When the scale values are 
correlated between the two data pros 
essing methods for each of the seven 
rating scales, the seven correlation CO- 
efficients are all above .99. In addition, 
the correlations between scales il 
different rating methods are quite high, 
as shown in Table 1. We have gmon 
in this table only the correlations fr 
the pooled data. Since all of the em? 
lations between the two scaling metho 
are so high, a table of correlations using 
the individual data is practically iden 
tical to Table i dis- 

While the correlations between 
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TABLE 1 
CORRELATIONS BETWEEN DISCRIMINABILITY 
SCALE VALUES FOR DIFFERENT NUMBERS 
OF RATING CATEGORIES 


Number of Rating Categories 


4 6 8 10 12 16 
6| .90 
8| .90 91 


12| 93 | 197 | i95 | .96 
16| 90 | 96 | 92 | i97 | .97 
20| ‘89 | i94 | 100 | 95 | 96 | o7 


Note.—Pooled data were used for these cor- 
relations. 
criminability scale values obtained from 
different numbers of rating categories 
are all quite high, it is clear that rating 
scales similiar in number of categories 
correlate more highly than do widely 
different scales. Thus the 4 scale and 
the 20 scale have the smallest correla- 
tion, while the 16 scale and the 20 scale 
have the highest correlation, along with 
Some other close pairs. 


Discriminability 
These data on correlations between 
discriminability scale values show only 
that the various scales produced are 
highly predictable, one from the other. 
hey do not show that the total amount 
Of discrimination is the same. In order 
to determine the total amount of dis- 
crimination, the simplest procedure is 
to compute the standard deviations of 
e scale values for each of the scales. 
he scale whose stimulus values are 
most spread apart is the scale with the 
STeatest interstimulus discrimination. 
Onsequently, standard deviations have 
een computed for each of the 14 dis- 
‘timinability scales, and these values 
are plotted in Figure 2. . . 
These data, like the information 
řansmission data, show an increase 1n 
t'Scrimination with an increased num- 
"er of categories on the rating scale. 


And while the individual data show 
slightly greater discrimination than do 
the pooled data, the difference is very 
small—mean standard deviation for the 
pooled data is 0.881 and for the indi- 
vidual data it is 0.897. 

What is more important, however, is 
that there is no evidence that the in- 
crease in discrimination with the in- 
crease in number of rating categories 
is in any way due to the pooling of the 
data. Both sets of points show a def- 
inite increase, and only for the 4 cat- 
egory scale is there any appreciable 
difference between the two standard 
deviations. 

Once again the line drawn is a least- 
squares solution which can be consid- 
ered only an empirical description. If 
one rating category is used, the stand- 
ard deviation must be zero, and the 
curve shown does not extrapolate to 
zero at that point. Likewise, with a 
larger number of rating categories the 
scale values could not increase sub- 
stantially, since no greater precision in 
equivalent pair comparisons could be 
obtained if there are more rating cat- 
egories than there are stimuli. In other 
words, since the discriminability scale 
values are obtained from equivalent 
pair comparisons, the larger range of 
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NUMBER OF RATING CATEGORIES 


Fic. 2. Relation between number of rating 
categories and standard deviation of the dis- 
criminability scale values. (The open circles 
are for the data processed as a group; the 
closed circles for the data tabulated by in- 
dividuals. The line is a least-squares fit.) 
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numbers in no way affects the standard 
deviations of the scale values. 


ESTIMATES OF INFORMATION 
TRANSMISSION 


We have shown that two alternative 
methods of determining the amount of 
discrimination in a set of stimuli eyi- 
denced by a rating procedure lead to 
the same conclusion, namely, that dis- 
crimination increases with an increase 
in number of rating categories, up to 
20 categories in our case, It seems 
very reasonable that these two meas- 
ures should, therefore, be related. It is 
the purpose of this section to show that 
they are related and that one can be 
quite exactly estimated from the other. 

The actual equation for estimating 
transmitted information from the stand- 
ard deviation or variance of the dis- 
criminability scale values is 


estU(R:S) =} log, [V(S) +1] [1] 


where estU(R:S) is the estimated in- 
formation transmission and V(S) is 
the variance of the discriminability scale 
values of the stimuli. The § notation 
emphasizes the fact that these are mean 
scale values. 

For the pooled data shown in Figure 
2, an estimate of the information trans- 
mission has been computed for each 
number of rating categories, and these 
values are shown as the closed circles 
of Figure 1. It is obvious that Equa- 
tion 1 provides a very accurate estimate 
of the unbiased information transmis- 
sion, and that the two measures are 
highly related. ’ 

The rationale of Equation 1 can be 
understood from two related points of 
view. First, we can conceptualize the 
discriminability scale values as a set of 
values for a set of stimuli, and this 
same set of values as the mean numer- 
ical responses which would be obtained 
if both the stimuli and the responses 
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were metricized in the unit normal sys- 
tem used with discriminability scales. 
In other words, we really have an S-R 
matrix, but one in which these new 
scale values, rather than the respective 
original designations, are used for both 
stimuli and responses. Now with such 
a matrix both the stimulus scale and the 
response scale are continuous, but we 
could compute an information transmis- 
sion figure from the matrix by catego- 
rizing the scales. Tt is not necessary, 

owever, to set up such a categorized 
matrix, since uncertainties can be esti- 
mated from variances (Garner & Mc- 
Gill, 1956) and we can easily determine 
the appropriate variances with no need 
to categorize. 

Information transmission can be com- 
puted from the following equation : 


U(R:S) = U(R) — Us(R), [2] 


where U(R) is the response informa- 
tion or uncertainty, and Us(R) is the 
response information conditional on the 


stimulus, or response equivocation, aS 
it is sometimes called. 


estU(R:S) = 4 log, [V(R)] 
—tlog, [Vs(R)] [3] 


In this equation 7(R) is the variance 
of the response continuum, and Vs(R) 
is the average variance of the responses 
taking one stimulus at a time, i.e. the 
within column variance, The two terms 
on the right side of Equation 3 are the 
estimates of the equivalent uncertainties 
of Equation 2, ( Actually, each of these 
terms in Equation 3 should have an 
associated term which reflects the width 
of the category interval, but since these 
terms are the same in both cases, they 
cancel each other—see Garner & Mc- 
Gill, 1956, for further explanation. ) 

Ow we know that the value of 
Va(R) is 1.0; since itis so by definition 
in a unit normalized system ; therefore 
the logarithm of it is zero. 
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On the other hand, 
V(R) =V(R) +1=V(S) +1. [4] 


V (S) is the variance of the actual stim- 
ulus scale values, but since we are as- 
suming a one-to-one relation between 
the responses and the stimuli (except 
for error variation), the mean response 
for each stimulus is the same as the 
stimulus value itself. Thus the var- 
iance of the means of the responses is 
the same as the variance of the stimulus 
scale values. But the total variance of 
the responses, V (R), will be the var- 
iance of the means plus the variance 
within groups (the error variance in 
analysis of variance terms). Since the 
error variance is 1.00, the total variance 
of the responses in this idealized matrix 
will be V (S) + 1. 

Thus the estimated second term on 
the right of Equation 2 is zero, and the 
first term on the right is directly esti- 
mated as shown in Equation 1. 

The other point of view from which 
to understand Equation 1 is by means 
of correlational analysis. Attneave 
(1959) has pointed out the general 
relation that 


€stU(R:S) = — 4 log, (1 — 7°) [5] 
Now we can define 
re=V(S)/(V(S) +1] [6] 


This correlation coefficient is stated in 
the usual form as the ratio of the pre- 
dictable variance to the total variance 
Of a set of scores, and as shown above, 
the total variance of the responses is 
vs ) + 1, and the predictable variance 
Of the responses is V (S). This corre- 
ation coefficient is quite similar to one 
Proposed by Gulliksen and Tukey 
(1958, p. 108), except that their co- 
cient is a reliability of mean scale 
Values, while the coefficient proposed 
ere is essentially a reliability of indi- 


vidual scores, and is a straightforward 
variance ratio. 

From the relation of Equation 6 
(l-r?) is 1/[V(S) +1], and from 
Equation 5, Equation 1 is directly de- 
rivable, 


Characteristics of the Estimate 


This estimating procedure is quite 
accurate for these data. It should be 
pointed out, however, that the accuracy 
of the estimating procedure depends on 
the normality of the distribution of dis- 
criminability scale values, since the esti- 
mating equations work accurately, as 
both Garner and McGill (1956) and 
Attneave (1959) have pointed out, only 
when both variances are approximately 
of the same distribution. This require- 
ment means that the normal distribution 
is necessary for the mean scale values 
since the error variance is already 
forced not only into a unit system but 
also into a normal system. If the stim- 
ulus scale values are off to the extent 
of providing a rectangular distribution, 
the error in estimating could be as 
much as 0.25 bit. The estimating pro- 
cedure can be used quite generally, 
however, if a correction factor is esti- 
mated for the shape of the distribution. 

It is interesting to note that. these 
estimates of information transmission 
agree very well with the information 
transmission measures corrected for 
bias, rather than with the uncorrected 
measures, even though no unbiasing 
procedure is inherent in the calcula- 
tions. The reason for this fact is that 
the discriminability scales are based on 
essentially all of the available data, and 
thus are not themselves subject to the 
same sorts of bias which operate with 
the directly computed measures of in- 
formation transmission. In this re- 
spect, the estimating procedure accom- 
plishes much the same effect in min- 
imizing the problem of bias as do 
the procedures suggested by Attneave 
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s n anion 
1955) for pooling data in variou 
ce each are appropriate to specific 
situations. 


Tue OPTIMUM NUMBER oF 
CATEGORIES 


The primary purpose of this paper 
is to demonstrate that information meas- 
ures and discriminability scales are use- 
ful in determining the optimum number 
of rating categories, and furthermore to 
show that these two measures are ra- 
tionally related. It is nevertheless ap- 
propriate to discuss the nature of the 
factors which affect the optimum num- 

ber of rating categories, 

The optimum number of rating cat- 
egories, or at least the number beyond 
which there will be no further improve- 
ment in discrimination, is clearly a 
function of the amount of discriminabil- 
ity inherent in the stimuli being rated. 
Thus there can be no single number of 
rating categories appropriate to all rat- 
ing situations. At the limits of dis- 
criminability, this fact is obvious. For 
example, if ratings are made of a set of 
objects between which there are no 
perceptible differences, then one rating 
category will do as well as two or more, 
If there is no perceptual discrimination, 
there is no need of a discriminating 
rating scale, 

On the other hand, suppose th 
have twenty different objects 
which are clearly discriminable, 
we must have as many rating cate 
as there are objects in order for raters 
to demonstrate their ability to discrim- 
inate, and ranking will be the most 
effective procedure. In information 

terms, we cannot allow the response 
information to limit the measurable in- 
formation transmission. . 

The limiting cases are manifest; the 
intermediate cases are much more am- 
biguous. Bendig and Hughes (1953) 
had their subjects rate 12 countries 
for familiarity, and when the Miller- 


at we 
all of 

Then 
gories 


Madow correction for bias is applied 
to their data, there is no improvement in 
information transmission beyond three 
rating categories, with which the infor- 
mation transmission obtained is approx- 
imately 0.25 bit. When Bendig (1954) 
had subjects rate preference for 20 
foods, increased information transmis- 
sion (again corrected for bias) was ob- 
tained up to at least nine categories, 
with which 0.32 bit of transmitted 
information was obtained. In the 
present experiment, increased discrim- 
ination is obtained with up to 20 cat- 
egories, with which approximately 0.50 
bit of transmitted information was 
measured. 

What seems to be clear from these 
few cases is that the number of useful 
categories increases with an increase in 
the amount of information transmis- 
sion, and that the number of categories 
needed is far greater than the infor- 
mation transmission figure seems to 
indicate, 

This fact that the number of useful 
rating categories is far greater than the 
information transmission seems to indi- 
cate is at first glance very surprising: 
Particularly since in the absolute judg- 
ment situation, which is so similar, quite 
another conclusion has been drawn- 
When information transmission has 
been measured in the absolute judgment 
context, the obtained measure has been 
used to determine the minimum number 
of categories with which the given “| 
formation transmission can be obtaine?, 
and many experiments have demon- 
strated the validity of this interpreta: 
tion of information measures. In ent 
words, it has been demonstrated 
there is a channel capacity, such tha 
no further information transmission 2 
be obtained if the number of ere 
is increased beyond that indicated by 
the channel ca; acity. 

The äbsolute udeni paradigm and 
the rating scale paradigm are in taci 
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however, quite different, and the es- 
sence of the difference is this: In the 
absolute judgment paradigm, the num- 
ber of stimuli and the number of re- 
Sponses are varied concomitantly, while 
by the nature of the problem they can- 
not be in the usual rating scale applica- 
tion. In the absolute judgment case, if 
we have an information transmission of 
2.0 bits, we can interpret this figure as 
indicating that we can obtain just as 
good discrimination if we use four 
stimuli and four responses as if we use 
any larger number. Furthermore, we 
can and should select the particular four 
stimuli in such a way that maximum 
discrimination will be realized. 

The nature of the rating scale prob- 
lem, however, is that we have a fixed 
number of objects or samples to rate, 
and we are not free to make this num- 
ber smaller. If 20 different foods, or 
50 men in a plant, are to be rated, we 
are not really interested in seeing how 
well we can transmit information if 
fewer foods or fewer men are rated: 
we want information about each and 
every stimulus to be rated. 

As long as we are not free to select 
a small representative set of stimuli 
(i.e., cannot recode into a more efficient 
Set), then we will need to have a large 
Number of rating categories to obtain 
relatively small amounts of information 
transmission. If two adjacent stimuli 
on a discriminability scale are reliably 
different, then the rating scale must be 
Sufficiently detailed to allow that differ- 
ence to be shown. R 

The nature of this problem is easily 
shown with reference to the hypothet- 
ical set of data shown in Table 2. For 
this table, the contingent uncertainty, 
Or transmitted information, is 1.07 bits. 
Now we can easily select two stimuli 
and two responses which will allow us 
to obtain 1.00 bit of contingent uncer- 
tainty—we simply take the first and 
fourth stimuli and assign one response 


TABLE 2 
AN ILLusTRATIVE CONTINGENCY TABLE 


Stimuli 
Responses 

1 2 3 4 = 
1 0.21 | 0.04 0.25 
2 0.04 |,0.17 | 0.04 0.25 
3 “0.04 |,0.17 | 0.04 | 0.25 
4 10.04 | 0.21 | 0.25 
= | 0.25 | 0.25 |°0.25 | 0.25 | 1.00 


Note.—Cell entries are proportions of joint 
occurrence, 


to each and the pattern of the data 
makes it clear that we can expect to 
obtain no confusions between these 
stimuli. Thus we will obtain 1.00 bit 
of transmitted information. 

Suppose, however, that we must keep 
all the stimuli, but want to reduce to 
two response categories. The best we 
can do is to group Response 1 with 2 
and also 3 with 4. But since the re- 
sponses to the second and third stimuli 
will overlap these two new response 
categories, there must remain some re- 
sponse equivocation, and we cannot ob- 
tain the maximum of 1.00 bit of infor- 
mation transmission. In this actual 
case, if the data from Table 2 are 
pooled to form the 4 x 2 matrix, the 
resultant information transmission 1s 
0.68 bit. 

Thus the essential difference between 
the absolute judgment situation and the 
rating scale situation is not in the char- 
acter of the stimuli, but rather in the fact 
that we feel free to select stimuli in one 
case and do not in the other. If loud- 
ness ratings were obtained under equiv- 
alent conditions, i.e., if we keep a fixed 
and large total number of stimuli, but 
change the number of rating categories, 
information transmission would vary 
with the number of rating categories 
Er small amount of available data 
suggest that the response information 
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Gie., logarithm to the Base r. ohe 
number of categories) should be a a 
5 to 10 times as large as the expecte 
information transmission, at least for 
small amounts of information transmis- 
sion. For small numbers of stimuli to 
be rated, it is almost certainly worth 
using’ complete ranking rather than 
coarser scales. And as a last statement, 
it is clear that information transmission 
cannot be lost by increasing the number 
of rating categories. Therefore, it is 
better to err on the side of having too 
many categories than to err by having 
too few. 
SUMMARY 


The purpose of this paper is to show 
how information transmission measures 
and discriminability scales can be used 
to determine the optimum number of 
rating categories. An illustrative set of 
data were obtained for 20 samples of 
handwriting with 4, 6, 8, 10, 12, 16, 
and 20 rating categories, Information 
transmission measures, after correction 
for sampling bias, show a small but 
definite increase in discrimination up to 
the 20 categories used, Discriminabil- 
ity scale values were also constructed 
from the rating data, and the standard 
deviations of the scale values show the 
same increase in discrimination. The 
variance of the discriminability scale 
values can be used to estimate informa- 
tion transmission, and the estimates 
agree very well with the directly cal- 
culated information measures, It is 
pointed out that the optimum number 
of rating categories must depend on the 


discriminability inherent in the partic- 
ular rated stimuli, and that the optimum 
number of categories will be large com- 


pared to the measured information 
transmission. 
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THE MULTIPLE ABSTRACT VARIANCE ANALYSIS 
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FOR NATURE-NURTURE RESEARCH ON CONTINUOUS 
VARIABLES 
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University of Illinois 


ESSENTIAL NATURE OF THE METHOD 


Research in psychological genetics 
operates either on: (@) particulate 
behavior syndromes, e.g., Hunting- 
ton’s chorea, phenylpyruvic amentia, 
which are visibly either present or 
absent, and which can usually be 
directly handled by Mendelian con- 
cepts and analyses applied to familial 
observations; or, (b) apparently con- 
tinuously distributed behavior traits, 
the variance on which one can attempt 
to assign to various environmental and 
genetic sources. Some inferences in 
terms of the former can to a slight 
degree be made from the latter type 
of biometrical and population genetics 
observations through the mathemati- 
cal treatments of Fisher (1930), 
Haldane (1932), Kempthorne (1957), 
Mather (1949), Wright (1931), and 
others. However, psychologists would 
in general be content, and, indeed, 
greatly heartened, even to be given 
some reliable information about the 
nature-nurture variance ratios for 
the more important continuous hu- 
man personality and ability dimen- 


1 The writer wishes to express his great 
indebtedness for discussion and advice to the 
following (who are nevertheless not to be held 
responsible for conclusions) : geneticists N. E. 
Morton and W. J. Schull; geneticists and 
statisticians H. W. Norton and R. W. Touch- 
berry of the University of Illinois; and statis- 
ticians D. L. Burkholder, R. A. Fisher, Q. 

cNemar, and, especially, C. R. Rao of 
Calcutta. He wishes further to thank his 
assistants Merlin Foster, John Horn, and 
Marcia Sedgwick for their persistence and care 
in the algebraic treatment of the equations. 


sions. For the overwhelming majority 
of psychological measures are, as far 
as we can see, continuous, and the 
instances that can be directly an- 
alyzed by particulate Mendelian anal- 
ysis are negligibly few. 

Until a few years ago all published 
instances of nature-nurture variance 
analysis ratios for psychological vari- 
ables were based on studies of identical 
and fraternal twins (sometimes sibs). 
At that time the present writer (1953) 
put forward a design called the 
Multiple Abstract Variance Analysis 
method, which seemed capable of 
answering more questions than the 
twin method and of avoiding certain 
artificialities and limitations therein. 
Although developed independently in 
the field of psychological measure- 
ments, it proved on examination 
to be much the same in general 
principle as the estimation of com- 
ponents of variance used in agricul- 
tural and biological genetics. How- 
ever, the MAVA method as we 
shall henceforth abbreviate it) con- 
tains features unique to the human 
cultural situation and to psycho- 
logical measurement. Moreover, It is 
new to psychologists, being exempli- 
fied in only two studies. The first, 
measuring personality factors by ques- 
tionnaire, occurred in 1955 (Cattell, 
Blewett, & Beloff), and the second, 
using objective, behavioral batteries 
for the personality factors, and a 
culture-fair measure of intelligence, 
was published in 1957 (Cattell, Kristy, 


& Stice). 
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esults of these first applica- 
Ps in that they demonstrated 
diverse and characteristic nature— 
nurture ratios for well known person- 
ality dimensions, agreeing, moreover, 
with ulterior evidence, were encour- 
aging. It seemed very desirable, 
therefore, to develop further the 
potentialities of the method itself2 
This paper sets out to make a closer 
examination of the method, its as- 
sumptions, its calculations, and its 
needs in terms of more refined de- 
velopment. 

The title of the method derives 
from the fact that conceptual vari- 
ances are introduced, i.e., variances 
which can only be inferred or “ab. 
stracted” from actual, observed vari- 
ances. The label designates this 
abstractness in order to avoid con- 
fusion with the usual use of the 
analysis of variance design in which 
questions are asked purely in regard 
to concrete variances, It is “‘multi- 
ple” because it views each concrete 
observed variance as the result of 
several, abstract, inferred variances, 
The MAVA method in fact postulates 
that the observed variance on a given 


trait, e.g., a personality factor score, 
measurable on a continuum, can be 
resolved into four contributory varj- 
ances as follows: 


? The writer has to apologize for five years 
of delay in presenting this fuller statement, 
occasioned by nothing more than writing 
difficulties in presenting a simultaneously ade- 
quate explanation of genetic concepts, for 
psychological readers, and of Psychologica] 
measurement assumptions, for geneticists, in 
anything less than book length. The delay is 
regrettable because psychological genetic 
researches have been started in the interim on 
what is essentially an obsolete twin design, 
or researchers have been com pelled laboriously 
to derive independently from the initial State- 
ment (1953) the formula now given in Tables 
l and 2. This situation may perhaps justify 
the present necessarily uncomfortably con- 
densed presentation. 
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Variance’ through causes resident in the 
differences of environment between fami- 
lies. Henceforth written Ob. 

Variance through causes resident in, the 
differences of heredity between families. 
Henceforth written Tbh. 

Variance through causes resident in the 
differences of environment for individuals 
within the family. Henceforth, oe. 

Variance through causes resident in the 
differences of heredity between siblings 
within the same family (gene segre- 
gation). Henceforth, osr. 


Furthermore, it does not assume 
that these sources are uncorrelated 
but attempts simultaneously to de- 
termine what correlations exist be- 
tween these hereditary and environ- 
mental variances, both within families 
(henceforth, Twewk), between families 
(henceforth, 7ve.ba), and, occasionally, 
through the rare correlation of a 
within and between variance. Ob- 
viously, the equations below imply 
that heredity and environment act 
mutually in a simple additive fashion, 
without any “catch all” category to 
accommodate to all kinds of inter- 
action effects, as usually conceived in 
analysis of variance. At the present 
stage of the model, one could either 
assume heredity and environment to 
be uncorrelated and then permit 
interaction, as in analysis of variance, 
or sacrifice interaction, The assump- 
tion of uncorrelated heredity and 
environment seemed to us, in the 
Psychological field, less tenable.? 

more ‘ideal model would be one 
which both allows covariance, as here, 
and also interaction, but, while such 
a model is being worked out, the 
Present model can be expected to give 
the usual approximate fit of a simpler 
model to a, possibly existent, more 

3 Were it not 
remarkable st; 
traits cannot 
and environn 
necessary 
variances 
the measu 


that one sometimes hears he 
atement that ““psychologica 
be wholly resolved into genetic 
nental causes” it might be un- 
to add here that we assume that E 
and covariances are exhaustive O 
red trait variances. 


| 
| 
| 
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complex natural pattern. Addition- 
ally, the assumption will be recog- 
nized that the values obtained for the 
variances and correlations are specifi- 
cally those existing in the given cul- 
tural and racial sample (referable, if 
desired, to parameters for a statistical 
Population of that type), and that 
generalization beyond this, e.g., to 
modern, primitive, etc. cultures, is 
Possible only in the light of additional, 
Strategic, experiments. Since a com- 
plete statement of assumptions in a 
model can be endless, the reader 
should accept those obviously im- 
plied, while others will be mentioned 
later as they become relevant. 

The general manner of building 
equations relating observed to ab- 
stract variances will be evident from 
inspecting, say, Equation 4 in Table 1 
below. Here it is stated that the 
observed variance within a series of 
pairs of siblings reared together, o%sr, 
is the sum of within family environ- 
mental variance, o%., plus within 
family heredity variance (due to differ- 
ent segregations of parental genes), 
wa, plus the covariance (if these 
Influences are correlated), 27oweown. 
The calculation of the observed vari- 
ances, as such, will be discussed later. 
Originally only six such equations 
Were written (Cattell, 1953), because 
1t was presumed that there would only 

e six unknowns—four variances and 
two correlation coefficients. Thus by 
the solution of six simultaneous equa- 
tions, using the six experimentally de- 
terminable concrete variances—o*sr, 
s4, etc.—it was hoped to solve for 
the six abstract, conceptual values. — 

However, six equations and six 
Observed variances are not enough, 

cause: (a) The last two or three of 
the six equations import other un- 
knowns. For example, one of our 
Necessary equations was for twins 
reared together and, unless one makes 
the psychologically unsatisfying as- 


sumption that the typical environ- 
mental difference in the lives of 
twins is the same as that between 
sibs, a new term, for a special kind 
of intersib environmental difference, 
"wer, Must be introduced. (b) Simi- 
larly it turns out that with additional 
experimental groups more than the 
above two heredity—environment co- 
variances are involved. (c) Linear 
dependencies systematically appear 
between the equations for “within,” 
“between,” and “total” variances in 
each of the family types one uses, 
thus reducing the number of effective 
equations. 

The additional equations that may 
be used, and the alternative assump- 
tions possible in using various com- 
binations will be discussed after the 
following section, clarifying the calcu- 
lation of concrete variances. 


CALCULATIONS AND ASSUMPTIONS 
CONCERNING CONCRETE, EX- 
PERIMENTALLY MEASURED 
VARIANCES 


Conceptually, the within family 
variance, on some such trait as in- 
telligence, surgency, etc., is the vari- 
ance (a) among the children, i.e., not 
including the parents; (b) of the 
typical American family (or whatever 
cultural or ethnic group one studies) ; 
(c) affected by size of family only 
statistically, i.e., to the extent that 
the variance estimate uses different 
degrees of freedom with different 
family sizes (actually we took mostly 
pairs but occasionally trios, from 
which to estimate this variance); (d) 
corrected for age differences, 1.€., 1t is 
the variance that would exist, on 
traits susceptible to age change, if all 
sibs were brought to the same age; 
d for experimental error of 


e) correcte 4 
a but zot for function 


measurement, 


fluctuation. : 
The justification for using the con- 
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cept thus is partly experimental 
practicability and partly the achieve- 
ment of maximum negotiability in 
further theoretical or calculation use 
of the obtained variances. Objection 
can be raised to (c) that large families 
might have, for example, smaller 
variance, on certain traits, due to 
more powerful group ‘‘coercion to the 
bio-social mean” (Cattell, 1957), 
This can be investigated by our 
design, through an experiment with a 
sample of small size families con- 
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allowing an age correction we are al 
ready here using an ‘abstracted’ 
rather than a literal, concrete vari- 
ance; but there is no point in cor 
founding psychological variance with 
age variance. If one knows the age 
variance of families in a given culture, 
the concrete variances can always be 
readily “restored” from the obtained 
variances through this more analytica 
approach and, meanwhile, the nature— 
nurture ratio is more pure, comparable, 


and comprehensible for an age-con- 
trasted with one of large size, Mean- stant group. 
while the adopted assumption is he issue in (e) is best seen by 
simpler. As to (d), it is true that by Setting out the equation: 
rok — ma a oy 
Pom rs ore Observed Error of Function 

ase ial Difference) Variance Measurement Fluctuation o] 
Although the allowance for error of that we are interested in, Function 
measurement, derived from the de- fluctuation variance is calculable from 
pendability coefficient of the test the difference of the dependability 
(Cattell, 1957) by de=] T ta is and stability coëficiente (Cattell, 
always only an estimate, it is better 1957), but for lack of the latter it w25 
than not allowing for it at all. The not actually tak t of the variance 
question whether function fluctuation in Our experi ate © ttell, 1957; 
should be taken out depends on one’s Cattell ieee a E ff ' 19337. 
conception of environment, Periodic hen not taker tit theoretically 
or aperiodic, short-interval, function swells the environme, i Hance jal 
fluctuation occurs in most Personality it fa significant rite à v istribution 
and ability measures (Cattell, 1957; of ratios for t at the aoa 
Fiske & Rice, 1955; Thouless, 1936) dimensions in WO En p act = 
pete or ee ae Causes, cidedly KWA P B O 
Shou one say at the er S : s ta 
hunger or sleep or the anxiety reset Predominance end. If experimen 


tion are not innate because they 
fluctuate hourly? The Writer is in- 
clined to the view that over a sufficient 
series of standard situations a reason. 
ably stable strength level Could be 
assigned to an individual on such 
fluctuant traits and that it is this 
level which we speak about When we 
talk of individual differences, Ce. in 
anxiety. For most purposes it is the 
nature-nurture contribution to this 
relatively stable “over a couple of 
months” value, at various age levels, 


‘ror is not taken out, it also ph a 
swells the variance Te 
ironment. Ideally, the" e 
fore, both should be taken out befor 
calculating the nature-nurture some 
e assumptions and problems g 
the estimation of the observed lly 
tween-family variance are essentia’; 


of the Same nature. Again, One ae 
‘stimating a value from the samp! to 
a statistical Population assume 

e 


Je. 
homogeneous with the samt) 
and, in this Case, the correctness © tio” 
above—that the statistical popula 
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is the typical American population— 
is harder to justify in the circum- 
stances commonly prevailing in na- 
ture-nurture studies. For, without 
quite extraordinary exertions by 
the experimenters, a disproportionate 
number of cases will come from social 
welfare agencies. Obviously, the ideal 
requirement is a k-way classification 
of families by religion, occupation, 
age of parents, economic, social, 
education, etc., background of parents, 
etc., corresponding to a stratified 
population sample (or, when investi- 
gating the effect of particular condi- 
tions, e.g., religion, on the ratios, a 
deliberately selected sample). 

Another specification which must 
be watched in defining and using the 
various variance ratios from the 
MAVA is whether the differences con- 
sidered are between boy, girl,‘ or 
boy-and-girl families. Even in the 
third case the equation for identical 
twins must, naturally, remain that 
for same sex pairs; but, provided 
oy has been used for the special 
between-twin environmental variance 
these same sex pairs can be used in 
the total solution. 

As to the degrees of freedom, the 
within-family values, when pairs of 
children are taken (deviation of each 
from their common mean, squared and 
added over & families), will be &. 
When z children instead of two are 
taken from each family it will be 
k(n — 1). For between-family vari- 
ance, obtained by subtracting the 
grand mean of all (sib) families from 
the mean of each family, squaring 
and adding, it will be k — 1. 


Concerts AND ASSUMPTIONS 
IN THE EQUATIONS 


It will be evident from the initial 
illustration by the case of within- 


1 Not only environmental, but also heredi- 
tary within family variances could be different 
°r girl-girl pairs, due to the extra chromosome. 


family environmental variance above 
that the experimenter in this area is 
placed in a theoretical dilemma pe- 
culiarly embarrassing to a perfection- 
ist. As he adds more equations in the 
hope of more completely determining 
unknowns, he adds new unknowns. 
In the writer’s opinion one can achieve 
a complete solution in the sense that 
the number of experimental situations 
capable of yielding new equations 
eventually overhauls the number of 
unknowns. But since it is not be- 
yond the wit of man to invent even 
finer conceptual subdivisions of family 
types and situations to which he 
would like a separate answer, let us 
admit that the process is asymptotic. 
Consequently, it is up to the experi- 
menter to draw a line where he will in 
data gathering, in the light of present 
estimations of magnitudes of various 
influences and the steps dictated by 
research strategy. 

Accordingly, the presentation is cut 
to two designs, one, called the Limited 
Resources Design, which makes grosser 
assumptions in order to accommodate 
to the resources of nine out of ten 
investigators, and the other, called 
the Complete Design, which aims at 
what, by ordinary standards, can be 
called a complete solution, regardless 
of difficulty or expense. The assump- 
tions in these two designs can best be 
discussed when we have considered 
the symbols used, as follows: 


Abstract Variances 


= trait variance (between sibs) due 
to within-family environmental 
(we) variation. Incidentally, we 
assume that this variability of 
environmentamong children raised 
in the same family is essentially 
the same for all kinds of families, 
including normal and adoptive 
families. On the other hand in 
the special case of fraternal twins, 
who are of identical age, and 
identical twins, who, according to 
clinicians, have an urge to be more 


1. oe 
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alike or (in other opinions); more 
individualistic, we enter with spe- 
cial terms capable of discovering 
such differences, as follows: 
2. ower = trait variance within family asso- 
= ciated with type of environmental 
difference peculiar to fraternal 
twins. po i 
3. awers = trait variance within family asso- 
i ciated with type of environmental 
difference peculiar to identical 
twins. 


4. own = trait variance due to within- 
family heredity (wh) variation. 

5. o%% = trait variance due to between- 
family environment (be) variation. 

6. o%y, = trait variance due to between- 


family hereditary (bh) variation. 


Another possibility—that the with- 
in-family environmental variance 
among sibs might be different in 
natural and adoptive families—has 


not been given separate representa- 
tion at this point. 


Correlations 


7. Tohwe = correlation between effects of 
within family heredity and within 
family environment (siblings). 

A systematically different value for 
this correlation might arise in special 
circumstances, notably (a) when the 
two sibs are of the same age, i.e., 
fraternal twins. For example, a trait 
which might be fostered in a younger 
by an older brother might be in- 
hibited in the interests of competition 
when they are the same age; (b) when 
it is an adoptive family, i.e., not true 
sibs and raised by foster parents, 
For, whether we believe with G, B. 
Shaw that parents are allergic to their 
own hereditary defects appearing in 
their children, or take a more kindly 

view that they will be more under- 
standing, the 7wews will at least be 
different. Moreover, the hereditary 
variance in adoptive (e.g, foster) 
families is typically greater than for 
ordinary families, being a sum (or 
other combination) of within and 
between hereditary variances. lf 


there are reasons for r settling down 
to a certain value with a certain 
variance, it is unlikely to remain at 
the same value when the range of one 
variable is altered. Actually we have 
assumed here that increased variance 
of heredity (Equation 6, Table 1) and 
of environment (Equation 5, Table 1) 
affect the ror .we term in the same essen- 
tial fashion and we have represented 
both by a single new 7 as follows: 


8. r’ohwe = correlation of heredity and en- 
vironment in adoptive families. 


In such families also, it will „be 
noted, we have the peculiar condition 
of a possible correlation between a 
between-group and a within-group 
variance, in that the foster parents 
may, for example, treat the children 
differently according to the hereditary 
variations appearing from the fami- 
lies from which they derive. Hence: 


9. reei = correlation of within-environ- 
ment and between-heredity pe- 
culiar to adoptive families. 
prime is placed on this ”, 
though there is no other Twerbh 
from which to distinguish it, 
because it helps classification to 
have all z's peculiar to adoptive 
families marked with a prime. 
correlation between effects ©! 
between-family heredity and be- 
tween-family environment. The 
frequently discussed correlation 
of “good” heredity with “good! 
environment among families is 4 
Possible instance. 


10. ronwe = 


There follow two possible correla- 
tions, probably nonzero only because 
of what may be called “placement 
agency” effects, In our society this 
arises partly from the deliberate 
Policy of welfare agencies of placing 
more able children in families likely 
to give them a better educational 
environment, but in most societies 
it might arise from a tendency tO 
adopt orphaned relatives. These place- 
Ment agency 7’s are marked by @ 
double prime, as follows: 


a 


EE ———— 
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11. rekte = placement agency correlation 
peculiar to sibs raised in differ- 
ent families, where within- 
family hereditary differences 
may become correlated with 
between-family environment by 
deliberate intention of social 
agencies in placing, e.g., brighter 
sib in more educated family. 

12. rinne = placement agency correlation, 
through any systematic tend- 
ency to match biological family 
characters of adopted children 
to environmental characters of 
foster parent families. 


Between four main variables—we, 
wh, be, bh—there are six possible 
correlations (more, with special vari- 
ables above) of which four would 
normally be dropped on the assump- 
tion that no correlation commonly 
exists of within and between vari- 
ances. However, having found two 
exceptions we should look for more. 
The possibility that roan and fwe.ve 
could be nonzero is however, hard to 
conceive. For example, it is difficult 
tosee any way in which an individual’s 
deviation in heredity from his family 
mean could be connected with the fact 
that his family differs in mean from 
the community hereditary mean. On 
the other hand, if, say, specific “‘roles”’ 
are more emphasized in highly than 
Poorly educated families, the possi- 
bility exists that an  individual’s 
deviation environmentally from his 
family mean will be greater when his 
family is environmentally most posi- 
tively deviant from the community 
mean. This would give a significant 
curvilinear correlation but no signifi- 
cant linear correlation. This will be 
recognized as another restriction on 
Our model, that it does not respond 
to curvilinear correlation among varı- 
ances. But it does mean that no 
especial reason exists for complicating 
Our equations with an vewe term. 
We are left, therefore, with l two 
main” (r) correlations, two com- 
Ponent” (r’) correlations, and two 


“placement” (r”’) correlations requir- 
ing terms for solutions. 

If space permitted, a number of 
other interesting theoretical possi- 
bilities could be discussed here, though 
most would be rejected as highly 
improbable. What may seem an 
undue refinement of conception, but is 
nevertheless vital in later interpreta- 
tions and applications, is implicit in 
the above labeling of within and 
between-family hereditary and en- 
vironmental variances as “effects” 
not “influences.” It would not be 
correct, for example, rigidly to reason 
that all environmental variance within 
families comes from influences within 
the family or all environmental vari- 
ance between families from influences 
within the community operating differ- 
ently on these families. Similarly the 
correlation of a within-family heredi- 
tary deviation with a within-family 
environmental effect does not imply 
that the correlation is brought about 
entirely by influences within the 
family. Tracing influences is a second 
step after locating effects. 

To determine the 12 unknowns 
above (6 variances, 6 7’s), which we 
have found it necessary to introduce 
on theoretical grounds in the Com- 
plete Design, one must employ, for 
reasons stated, more than 12 equa- 
tions. The following 16 experimen- 
tally determinable variances, each for 
a particular kind of group, are re- 
quired to represent the left side of 
each of the equations needed. (The 
equations themselves are set out in 


Tables 1 and 2). 


= variance in the trait as meas- 
ured among identical twins 
together by their 


1. @irr 


reared 

parents. i 

variance in the trait as meas- 

ured among identical twins 

reared apart. 

= variance in the trait as meas- 
ured among fraternal twins 


2. oiTa 


3. o°FTT 
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raised together by their true 
parents. 5 
in = variance in the trait as meas- 
4. osr ured among ordinary siblings 
reared together by their 
parents. " 
= variance in the trait as meas- 
ured among ordinary siblings 
reared apart. 

= variance in the trait as meas- 

ured among unrelated chil- 
dren reared together in one 
family with foster parents 
who are biological parents of 
none. 

= variance in the trait as meas- 

ured among unrelated reared 
apart (general population). 

8. o*pirrr = variance in the trait as meas- 
ured between the means of 
identical twin Pairs in their 
true families. 

9. array = variance in the trait as meas- 
ured between the means of 
identical twin Pairs raised 
apart, 

10. o*sxr = variance in the trait as meas- 

ured between the means of 

natural families, i.e., means 
of sib pairs (or trios, etc.) 
reared together, 

variance in the trait as meas- 

ured between the means of 

biological families whose sib- 
lings are reared in different 
foster homes, 
= variance in the trait as meas- 
ured between the means of 
social families, i.e., adoptive 
families of biologically un- 
related children, 

= variance in the trait as 
measured among half-siblings 
raised together, 

= variance of half- 
apart, 

15. o'snsrr’ = variance in the trait as meas- 
ured among means between 
half-siblings raised together 
by one true parent, 

16. o?3nsar = variance of means between 
half-siblings raised apart, 


5. o7sa 


6. eur 


7. ova 


11. otppp = 


12. otpsp 


13. onsr 


14. nsa siblings raised 


Other familial groups yielding ex- 
perimentally determinable variances 
are theoretically conceivable, but they 


5A further specialization can be made here 
according to whether child is raised with 
mother or father. 
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demand too much tact on the part of 
the experimenter in locating them, or 
too much moral aberration on the 
part of the community, to be prac- 
ticable! 


EQUATIONS FOR THE COMPLETE 
SOLUTION AND THE LIMITED 
Resources DESIGNS 


The 16 equations which link the 
above experimental, concrete vari- 
ances with the abstract variances and 
correlations seemed to produce them 
are shown in Table 1 below. We 
assume they will require no discus- 
sion, beyond the illustration already 
given of a typical equation (Equa- 
tion 4) for most readers. Perhaps a 
few words on a more esoteric example 
—Equation 13, half-sibs reared to- 
gether—will complete the explanation. 

Here we might have two children 
of one mother by two fathers. Inci- 
dentally, introduction of a different 
term for children of two mothers by 
one father is not considered necessary 
at this stage, nor is the variance con- 
sidered modified by the different heads 
of family in which the two half-sibs 
might be brought up together. They 
will differ by (a) within-family hered- 
ity variance, 6%, For each is only 
one ‘choice’ among the possible 
Parental gene segregations; (b) also, 
in this case, by half the interfamily 
hereditary variance, łom for one 
Parent is the same and one different; 
and (c) by the usual within-family 
environmental differences, owe. In 
terms of covariances there will be @ 
a “component variance” correlation 
of the between-family hereditary var a 
ance, (which is now part of the tota 
Within-family environmental vars 
ance) with environmental variance in 
the family. This produces the co- 
variance term, V2r'wo.0howeo on (res 
Coefficient v2 before this is aeva 
from 2/v3, the usual coefficien 
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TABLE 1 


THE SEVENTEEN GENERALLY PRACTICABLE EQUATIONS FOR THE 
COMPLETE MULTIPLE VARIANCE DESIGN 


Note.—The arrangement of equations in Table 1 can, for purposes of simplification be regarded as creating 
two blocks: 1. 1 through 12, which are constellations of ordinary sib and twin families, and which divide into 
within variances (1 through 6) and between variances (8 through 12), and, 2. constellations in half-sib families, again 
the first half being within and the second between variances. The matching of each between with its corresponding 
within is indicated by (Suppl. '"X")—Supplementary to the earlier numbered equation—written beside the matched 
equation, 


err =g xe (1) 
o TA = O79 F ob. [2] 
tere = own + otee + Lohor (or replace last term by 2rwh-werTwhTue!) [3] 
otsr = oh + o%ve 2 wh-reeTwhF we [4] 
asa = oton F ote + abe Hr wheweFwrPwel tb 21" 0h- bewe] [5] 
our = on + owe + osn 2 wh. weTwhOwe 2K we-bKTweTbA [6] 
uA = oh + oe + on F Orbe F ZrohsweTwhFue + Loh beTbhObe (7] 
(Suppl. 1) 
otpirre = 2o°wh + owe + 207, + 207% + 2V2rwheweTwhowe + AT DA-beTbAT LE [8] 
(Suppl. 2) 
oprrar = 2o%wn + atue + 20%, + ore + VIP wheweTwhwe + ZVIT vh beT Arde 

H 2V2r'we-bnower bab 2VIr wh beT weno be] BJ 
[o2serr = oten + o%ver + 2o% + 20%. F Pehme Genrer F ATININ] [9a] 
(Suppl. 4 or 3 and 4) 
pyr = orn F o%xe + 2o%n + 2o%e + Luhde we F Ebh- beTbhT be C10] 
(Suppl. 5) 
otnpp = oth F ove + 2o%n + o% H WrrohervceTwhFie H ZVZ’ we- bhw bh 

QI" hbe bho bel 2r" wh-beFwnT be] E] 
(Suppl. 6) 
o?psp = our + atea F aon + 20% + 2r'whceTwntwe F 2r'webhO weoh 

+ 2VŽr" oh beT thel + 2VÄr” oh beT whO be] [12] 
ousr = ten + owe + hoor + 2rwh-werwhowe + VIP" we bhO weT bh [13] 
dusa = on + owe + 40% + o*i + Wroh-weFwhFwe FH VIT or beT KT be [14] 
(Suppl. 13) i 
Sangre = dur Fom F iora + 20% + WrohetoeFwhFwe + VST oh: beT bh be [1s] 
(Suppl. 14) 
Ongar = ek Fohe F imat ou T WrohereF whFwe + VÓT Oh: beT v7 be [16] 


are similarly derived and from con- ` 


the root of 


2 being multiplied by 
present only 


a half because a°} 1S 
as a half variance.) (e) The usual 
covariance of heredity and environ- 
Ment, 2rrwe.whTweF wh: Other equations 


sideration of each it will be evident 
that a good deal of careful reasoning 
from first principles, and discussion 
among experts, has gone into them. 
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TABLE 2 
Tue TEN EQUATIONS USED IN THE LIMITED RESOURCES DESIGN 


C1] 
Er = Owe 2 
otprr = Poh F Owe F 2rwh-weT whore! 
disa = oh F Owe + 2rwheweT wh we [3] 
osa = wh + Twe F obe F Rron whee [4] 
eur = wh F Owe + 07h + rohim whee + 27 we-bhTweTbh [5] 
oya = wh + ore + osn + o% + 2roheweF wh we + 2ron.veT ono be [6] 
@pirre = 20%wh + oue + 20% + Lo% H 2VIr oh mT ohus + 4ron.noonore Ea 
onne = aeh + ove + 20% + 20t + rohont oe + Aron. deo ono be [8] 
BBE = wh + axe + 20%, + o?r + Pahau + 2VEr' ce.bnoueo on A 2VEr"on-seounode [9] 
oBsF = Oh + owe + otan + 20, 


F 2rohrceOwhowe + 2r'we.bhTweroh + 2V2r"'onbeoonose [10] 


To get the systematic organization 
of terms needed in such a presentation 
we have followed the rule of putting 
heredity terms before environmental 
ones, variance before covariance, with- 
in-family before between-family. The 
primes and double primes on 7's are 
explained above. Occasionally, where 
a mathematically Possible combina- 
tory term might also seem needed by 
the reader, but has been dropped by 
us on fuller consideration, it has been 
placed in brackets, i.e., Strictly we 
regard the term as unnecessary. We 
have deliberately refrained from work- 
ing out as decimals the various 
numerical coefficients, since their rep- 
resentation as radicals enables the 
reader more easily to perceive their 
derivation. 

The equations of Table 1 are pre- 
sented as a totality of experimentally 
available equations, from which an 
experimenter may draw according to 
the research resources available to 
him and the degree of simplification 
of assumptions which he cares _to 
make. Even the Complete Design 
cannot simultaneously use all of them, 
because of the linear dependencies 
noted above, though this does not 


avoid collecting the data that would 
be required to use virtually all of 
them. The 11 equations which we 
advocate as most effective for the 
final determinations of all 11 un- 
knowns in 16 are: t 2585.5). 6) Fi) 8158s 
10, 11, and 14. The only regrettable 
waste in this choice is that it does not 
use sibs raised together, Equation 4, 
an easily obtainable group, but the 
alternatives are worse. For example, 
half-sibs raised together are needed in 
most others, and these are socially 


difficult to identify, with scientific 
reliability. 
The costly features of the Complete 


Design are identical twins reared 
apart and half-sibs reared apart, both 
of which require much effort to locate. 
Accordingly we have suggested for 
most researchers the Limited Re- 
sources Design in Table 2, which uses 
only 10 equations and requires only 
sibs, and, in twins, those reared to- 
gether. A sufficient sample of these 
should be locatable by any investi- 
gator near a large city. It is not 


s are 
‘Commonly, one has the choice of eee 
either the within or the between varian 


from data gathered on a particular type of 
familial group. 


$ 


a 
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possible to create this design simply 
by lifting 10 equations from Table 1; 
for the assumptions that are made in 
each have to be changed in order to be 
able to reduce the total number of 
equations. The simpler assumptions, 
(a) consider the environmental vari- 
ance for identical and fraternal twins 
raised together to be identical, as 07 wer ; 
(b) omit differentiation of 7'wh.we and 
Twh.we} i.e., the modification of the en- 
vironment-heredity correlation when 
the hereditary and the environmental 
variance are increased in families, re- 
spectively, of unrelated children raised 
together and sibs raised apart. Thus 
it solves for five variances and three 
correlations, instead of for six vari- 
ances and five correlations as in the 
Complete Design. 


THE SOLUTIONS AND THEIR 
CONFIDENCE LIMITS 


Since the solutions of the above 
equations are a considerable under- 
taking—mainly because of the need 
to choose the best sequences in com- 
bining them—it would be helpful 
to have tables set out for the general 
solutions. The results of some three 
or four months of algebraic work and 
checking are set out in Tables 3and 4, 
in which the researcher can simply 
insert his experimentally-obtained, 
concrete variances to obtain the 
abstract variances. In order not to 
interrupt the main presentation these 
have been placed at the end of the 
article. Quite apart from the sub- 
Stantial algebraic task there are cer- 
tain theoretical problems in solving 
the equations of Table 1, which have 
been discussed in another context by 
Nelder (1953). Some of these are 
aPproached below. f . 

Additionally to the solutions in 
lables 3 and 4, which are only as re- 
liable as human pursuit of long ma- 
nipulations of simultaneous equations 


can be, and should accordingly be 
checked before use, solutions have 
been worked out along different lines 
on the electronic computer. A ma- 
trix, A, consisting of the coefficients 
for variances and covariances arranged 
with rows corresponding to the 11 
equations from the Complete Design, 
is post-multiplied by a Vector X of 
abstract variances and covariances to 
yield the vector of concrete variances, 
B, thus 
AX = B 


Solving for X would give the ab- 
stract variances, thus, 


X = AB 


But, since we wish to compare the 
coefficients for the concrete variances 
in these solutions with those in Tables 
3 and 4, which have to be utilized 
in the following discussion on stand- 
ard errors and confidence limits, we 
stopped the solution short at obtain- 
ing A~. This was done by the fol- 
lowing transformation because the 
Limited Resources Design, unlike the 
Complete Design, does not present a 
square matrix of coefficients 


A= = (4'A)14’ 


These matrices of coefficients may be 
obtained by application to the author. 
The principal incompleteness of the 
MAVA method as first applied (Cat- 
tell, Blewett, & Beloff, 1955) to 
psychological data was the lack of 
standard errors and confidence limits 
for the values obtained. As Tables 3 
and 4 remind one, each of the abstract 
variances finally obtained is a func- 
tion of quite a number of experi- 
mentally-determined variances, while 
the correlation coefficients have at 
least an equally complex derivation. 
At the present juncture in research 
it becomes important to attempt to 
solve the complex problem here pre- 
sented. An additional complexity 
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TABLE 3 


SOLUTIONS FOR SIX ABSTRACT VARIANCES AND Fiy HEREDITY-ENVIRONMENT 
CORRELATIONS, IN THE COMPLETE DESIGN 


F z i = ji i ion, a tin 

= venience, certain equations below, namely, those having a quadratic solution, are set ou r 

a PT aoe nee the roots which give possible positive and naait entries, while the follow- 

Dee dR ie give the actual resulting alternative answers—and in decimals einen there is no longer 

need to reveal the mode of derivation. In these cases both solutions are retained until further considerations cause 
one to be ruled out. 


BITAF —.5V20* BBP +o2)p4 +(.5V2—lo* aye ee | 
Gris a a Aaea 205ihge [la] 


Owe =2o*v4+.70700% Bere —o*s4— -10700* gira +. 10700? zpp+o2)74 — 1.70700? ye [1b] 


we =20?vaA—.5SV20* rT R—o734+.5VIq2 


Pwe = 21 eherceF*wh — (oher — © PTT) — T ohw VAP 


whea or — Aoter Fhoto [2] 
Ove =2 pT [3] 
Fun = (.5+.5V2) 62x ppp — 5V20%s4+.50%e rap — Sonar —(.5 +-5V2)0? ave +.SVIo% 774 [4] 
own =1.20700 a irre — -107005 4 +50? aipap— So*pnp—1 20700" ayr-+.707002;7.4 [4a] 
oy, = ~ 20700" a irre +.7070054 +50 pirap— So* app +.2070c? py p— 70700274 [4b] 
re =.5VIotgimrp +o’sa—2o?y4— SVZotBirar+.5vVZop ap + (1—.5V2)o2 aye [5] 
oy =, 70700?Birrp +o%54— 2o?y4— :10700? pip ap +. 70700? ppp, 2930c aye [5a] 
oy =; 70700%siTrr +0254 — 2e?ya +. 10700? gir ap~, 707007 gp-+-1 107062 ny p [5b] 
CNA =0°Bi rrr +VIo?s4 +(2+V2)c2y4 ~Parar+oppp—otgyp— (2 V2+2)o2ns 4 [6] 
on =e prep 1.4 141a2s4 +3.4141o2y 4 —arrar+o2y pp =o gy 4.828207 54 [6a] 
tn =0°BiTrp—1 Al41o%s4+.585952, 4 —oBirar +o npr ~ Pane +.82820275 4 [6b] 


Toh-we = 250? aye — (.25V2-+.5)o2in4—,2592 
(S+.5V2)¢?nrpre—.5Vio2g4 Sot 


BITTR-+ (-25V2+.5)o2¢4 
BITAF—.So*ppp— (5 +.5V2)0 pyp+.5Vl02/7A 
LSW 25)ot orn ar+ (25VF-4.25)atgyy [7] 
BITTR —a*s 4 +.5V202 ppp — SV20* pap +o a+ (.5v2— 1)o? ane 
ranae = LBIES otsa (25VE-4. $92 
V(5 +.5V2) 0% aiprp—.5 Vle? 


X V2otya—.5 Vo 


Bsr + (.25V2-+.,25)92 
sa+.So°Bi Tap — So? 


BNF— (.25V2+.25)o2p perp 


Bar (S+.5V2)o2pype.SViotra 
= 2507 arrart.250%p gp 


[8] 
SAt-SVlo2 arp ap — SVlo*p ap +o 74+ (.5v2— l)o*ayr 
(2+1ensa+(1 =-25V2)q2 


BUP=5V20%54~ (24.5 V2)oty 4 
V2o%va—.5V20% arene osa ATTESA +(.5V2—Do®avr 
+(.5—.25V2)o? Birar + (25v2—.5)q2 


irre + (.25V2 —,5)q2 
X Normer tAr na t Aoa ea paige 


X V2o?ya—.5 Vio" grrr =o? 


Pwen = 


BBR +.5o?yp [9J 
F— (2V2+2)o%754 


* For the sake of simplification it is best to determine y; ¥ in this 

equation to determine geet o čdireciiv t me values for rense and ofen and substitute these in t 
olutions 12 throug’ are not directly for any Tequired They 

are introduced as conveniences in the process of solution, ney con e there are only 11 pa uukagwas i were 

treated as single unknowns in the solution of the original equations. The be on containi: Aa can be directly 

obtained for these solutions by dividing the solution obtained by the Correlations rih.ce etC., 

Variances indicated on the left-hand side of the equation, 


C. f the 

sey Product of the corresponding squares o a 
i «Or, if it is desi i rs be use 
and by direct substitution, the values for rwh.we, etc., can be obtained. orga the four Saviors (7 pi may solution 
is given to save time and space. (If the plus value of 2 is used in obtainin, thes iances (Equations 1-6) con- 
sistency must be followed by using plus values throughout, g varia: 


| 


-l 
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TABLE 3—Continued 


_ (.S+.25v2)o%errar— (5 +.25V2)o*Birre— (5 +.5V2)otva— (S+.25V2)o* BBF 
= V.S Birre -+o2sa—202y4—.5V207 BIT AF +.5Vl0°B BF + (1—.5V2)o°BNF 
— (.5+.5V2)07s4+(V2+1)o2nsat (.5+.25V2)o? BNP [10} 
X Vo? grrr +Vl0%sat (2+-V2)0?va—o8arrar+o2aar—oanr— (2V2+2)o%nsa 
_onsat (.75V2 —.75)o2ave+(.25V2—1)o%sa+ (1 —.5V2)02uat (.75V2 —.25)o*Birar 
i V.5V2o?sirrr+o%sa— 20° 4—.5V20° Tr ar+-5V20°p ar + (1—.5V2) 0° BNF 
+(.25 —.75v2)o* errr + (.25 —.5V2)o*aar—.So*ur [11] 
X Votnirrr+V2otsa + (2+V2)o°va—o*sirar +0°BBr —o°Byr — (2V2+2)o%ns4 


FohweTwhTue =.250° Byr — (-25VZ+.5)o rra —.250°Birrr + (.25V2+.5)o%s4 
— (.25VZ+.25)o?Brrar + (.25V3+.25)0sse [12] 


Tohebe 


” 
T bhebe 


TohweTwhOwe =-2502nve — 8535077 —-2502 nirre +.85350°s4 —.85350°prrar +.85350°nBr [12a] 
ToheweTwhTwe = 2502 Byr —-14650% 74 —.250° Birre +.14650°s4 —.14650°srrar +.14650°BBr [12b] 


T'whweTwhTwe = (.25V2 +.5)o7s.4— (.25V2+.5)o%asr+ (.25V2-+.25)o?BNr 
— (.25VZ+.25)o?Bırrr—.250°pırar +.250°sBr [13] 


1 wheweTwhO we = 8535025 a —.85350t Bsr 4.60350? BNr —.60350°Birrr —.250° Birar +.250°BBr [13a] 
1 whoweTwhTwe = 1465029 4 —.14650°Bsr —.10350° BNr +.10350°g1rrr —.250° Brar +.250°BBr [13b] 


T'webhoweron =(VIH1)onsa+(1—.25VI)ot syr —.5 Vosa — (2+.5V2)o°ua i 
+ (5 —.25V2)e2arrar+ (.25\2—.S)otarrre +(25V2—.5)o*ssr+.50*ur [14] 


TeebhOetoh =2.4141o0nsa +.65650 BNr — 70700284 — 2.707007 va +.14650° Bir Av 


—.14650°sırrr —-14650°8Br +.50°ur [14a] 
Y webhOweoth = —-4141o%7s4+1.35350% ave +.70700°s4— 1.29300? v a +.85350°BI TAF 
—.85350°Bırrr —.85350°BBr +.50°ur [14b] 


Tonvecontbe = (5 +.25V2)o? Birar — (.5-+.25v2)o* BITTE — (.5+.5v2)o7va 
— (.5+.25V2)o’BBr — (5+.5VDosa+(VZ+1)o°nsa+ (5 +.25V2)o% ane [15] 


=.853502nrrar—.853502arrrr — 1.20700%v a —-85350* nBr — 1.207007 
+2.414lo%vsa+.8535eavr [15a] 


14650? g Br +.20700°s4 
—.4141o?nsa+.46500°gnr [15b] 


Toh beT bhO be 
TOA beT HAT be = 4.146502 a rar—-14650°n rrr +.20700° va — 
2sa-+(.75V2 —.75)o%ane + (.25V2 — lotsa + (1—.5V2)o*va 


n até 
“4 oy (TSv2 —.28)e¢arrar+ (.25—.75VD)o%arre + (.25—.5V2)o¢aar — Sour [16] 


: = 310602 pyr —-64650%s.a+.29300%y a +.81060°prrar 
Y snpeoonebe =0° NSA +H-31060°BNF Dua t S10 AEE n 


—1.81060°8NF — 1.35350?sa +1.70700?y a —1.31060°B1TAF 


T" bhi beT bhO be =0° HSA 41.310603prmar +-95700°Bzr —-50*ur [16b] 


lies in the fact that most investigators ous ratios between these, or between 


(including the present writer), have 
usually not been content with the 
values for the four main variances 
but have proceeded to calculate vari- 


certain sums of these, e.g., the ratio 
of total environmental to total heredi- 
tary variance. For these final ratios 
are more likely to express the ultimate 
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TABLE 4 . 


SOLUTIONS FOR THE FIVE VARIANCES AND THREE CORRELATIONS IN THE 
LIMITED Resources DESIGN 


Ore =(2—V2)osr+0°sa —o°ur+ (2V2 —2)o*u4—o°nrrrr+(2— V2)o’nyr+0°BBr—o?gsr [1] 


hes =-58590%er Hosa —o*ur +.82820°u 4—0? Birre 4.585902 5yp-+0tppr—očnsr i [la] 
us MBALAl or gr holga our 4 818 ofp ao aE etn pei E R [ib] 


Ore =0 ITT 


[2] 
Ai =o*sr(VZ— 1) +H.5VZos4—.5Vlotur+ (2—2 V2)oty 4 +0"Birrr 
+02 —2)o° ane +.5V20" Bnr —.5Vlo2gsp [3] 
dta eet H lON0etsa— 70708 yr B282 ahata Sisina 
-+.70700* nn —.707002 psp [3a] 
ater = —2Ald1ets7—.70700%s4+.1070—*vr-+4.828208y 5 tenner 
3414 1¢%pyr—.70700%par-+.7070%% nop [3b] 
Pwe =07¢4—2o2y4+o2pye [4] 
on =(1 +.5v2)02s4+(1 +.5V2)0% pn —2e?y ato? pyp — (L+.5V2)o?pse—.5Vo2 yp [5] 
aon SOTO sa +1-010otaar —2etvaetove~1.7010etase—.107dst, [5a] 
oy), ~2990'sat 29300ts ar —2etva-toteve~ 2930s" pse-+.1070etng [sb] 
a Serre — 502 pp — Se girre— :25V20%54— :25V20" ppp 
VEAC- DE SVa Seton O Ne ee 
(Square root sign continued) 
+:25V20" Bsr + (1 —-5V2) 6? aye 4 (V2 — 1)oya 
HOZ- 2)otaxe +5 Vinar —5Vio gan 
(S —.5V2)otsr4(.25v2)o? T 
XV(2=Whotsp-totsa et - 7 ee 
a ost +o*sa—atyp + (2V3 Devs ~#arrre+(2—V2)olnvrbe®nap ao ner 
pet 250° ur — 25025 4 
V2- VDotsrtotsa— otor FOU a 


va—o°BiTrr4(2— V2)o% 

—-250" pae+.250%n gp 

xva +S otsa t (1+5 Votar Zoto F etg (1+.5V2)c2pgp— SVr n 
.25v20"u7— (1 +.25V2)o?s441.50? 


Votsa—201 Fonn 


byr-+o*pnr—otnsr 


Toh be = 


vA~.So°gyp 

F 
=(.25V2-4.5)q2 

XVUF.S Vosa +0 +.5V2)02 ggr 


VZo?y 4 —.5V20°s4—.5VI07 pyp 


BB+ (25,2 +.5)o?nsr 
—20uA +o? pyp — (1+.5v2)o2 


[8] 
Bsr—.5Vlo° ur 
Oh be = 


Vo®sa—20°vato*ane 


a These variances (Subequations a and b) are ivi i = igi 
throughout, and wherein V2 assames a valis D LHAT in qp ONS wherein VZ assumes a value of 14141 in 1 
» Solutions 10 through 14 are for phrases containing radic: 
single unknowns in the solution of the original equations, T 
these solutions by dividing the solution obtained b; 


dicals. These ph in Table 1, aS 
sg UH Dhrases were treated, as in Table 

y the prego elAtions fehue ete., can be directly obtained fron 

indicated on the left-hand side of the equation, "Or, ic waneeuuct © four asiu ponding squares of the variance 

substitution, the values for fiche, etc., can be obtained. In Equations 6 thier: ie ray Bes solution is given 

to save time and space, (If the plus values of 2 are used i obtaining the variances (Equations 1-5) consistency 

must be followed by using plus values throughout.) a qı 


Ow a a 


— 


MULTIPLE ABSTRACT VARIANCE ANALYSIS EQUATIONS AND SOLUTIONS 367 


TABLE 4—Continued 


+.25v2e*asr—.25Vle%ur [9] 
X V(L+.5v2)o%sat (1+.5V2)0° par — 20° a-+o°axr— (1+.5V2)o2asr—.5Vle2ur 


TwhweTwhF we! = So*prr —.So* 77 —.5o*p 7 TF —-25V20%s 4 —.25V20°ppr+.25VIe%BsP 

+(1—.5v2)o? aye (v2 —1)e*vat (.5—.5V2)o*sr+.25Vlo*ur C10]? 
TwheweO wh ve! = Seer —.So* rr —.5e° grrr —.353507s.4 —.35350?pr Br +.35350°8SF 

+.29300?pvr+.414lo%y a —.20700%s7+.3535e2ur [10a] 
Fohe whee =. SoPTT— So 77 —.So* grrr +.353507s4+.35350% pp —.35350° RSF 

+-1.70700? avr —2.414 1027 4-+1.20700%s7 —.353502ur [10b] 
TwhweFwhFwe = —.So%s7 + (.S+.25V2) 0% ur — (.5+.25V2)0%s54— (.5 +.25V2)o°s Br 

+(65+.25v2)o*asr—.So*ayrtotva [11] 
TwhweTwhFue = —.So*sr+-.853S07 ur —.85350°s 4 —.85350° x Br +.85350°psr —.5o°Byr +0°ua [lla] 
TwhueTwhOue = —.So*sr-+.14650°% vr —.146502s4—.14650° x Br +.14650°psr —.50?BNr +0°va [11b] 
1M werhwetbh =.250°yr — .250°sa—.250%par+.250°%RsP [12] 
VONTA =.25VZo% yr — (1+.25V2)o%sa+1.50°%va—.So2ayr 

- (.25VI+.5)o?BBr + (.25VI+.5)o? Bsr [13] 

n Tomberoiobe =.35350%v7 — 1.35350%s.4-+1.502v 4 —.50%avr —.85350°pBr +-853507 BSF [13a] 

TomteTonote = —.35350°ur—.64650°s4+1.50°ya—.50?pyr —.14650°BBr +.14650°8sr [13b] 
1 Oh- beT bh be = ia —.5VIe?sa—.5Vlo? gyr +.25VJo? psr —.25Vlo*ur [14] 
1 Oh beTbhTbe =1 l 1410°v a —.70700?s4—.70700°Byr +.35350°Bsr —.35350°uT [14a] 
nse bho = — 1.414 162y.4+.707002s 4-+.70700% ane —.35350°psr +.35350%ur [14b] 


inferences in which one is interested, 
as a starting point for further theoreti- 
cal psychological developments and 
experiments. Yet determining the 
distribution and the confidence limits 
of ratios presents an additional chal- 
lenge to statistical inference. 

The typical form of any solution 
for an abstract variance, asin Tables 3 
and 4 above, may be written : 


6%, = kyo, + kao?s +++ + no’, [2] 


where @% is the estimated abstract 
variance, e.g., within family environ- 
Ment; kı kz, etc. are numerical, 
Constants; and o°, etc., are the esti- 
mated concrete variances for particu- 
lar family relation population species. 

ur need is to determine the variance 
Of the variance on the left, and to do so 
It is necessary to decide whether the 


variances on the right are correlated. 
Usually within and between mean 
squares are obviously uncorrelated for 
any one particular analysis. In this 
case, since they are quite different 
groups of subjects (Ss), it might be 
argued that the chances of systematic 
correlation are still less. However, 
the question is actually a very com- 
plex one, to be answered only by 
study of the structure of the data in 
each case. For example, in some 
instances of sibling pairs, they are not 
random subgroups from the larger 
group. The concensus of expert sta- 
tistical opinion consulted nevertheless 
is that, in terms of possible correlation 
over a series of samples, we are justi- 
fied in treating these as uncorrelated 
variances. Accordingly the variance 
of the abstract variance estimate is a 
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simple sum of the independent func- 
tions of the experimentally obtained 
variance estimates on the right of our 
equations, i.e., 


[3] 


where iis any subgroup. 

As stated above, it is likely in most 
actual experiments, unless one is pre- 
pared to throw away hard won data, 
that the numbers in the various ex- 
perimentally obtained family constel- 
lation subgroups (sibs, twins, etc.) 
will differ. If the successive groups in 

Equation 1 have 7, 22, etc. cases the 
above estimate of the variance of the 
variance is stated more completely as: 


oo, kta! 

cacat jme m—1 
2k? Zkrat 
a e aea S| 


However, if 2 is small—less than 30 
cases (unlikely in these experiments) 
—the degrees of freedom which give 
an unbiased estimate become x; + t 
etc., according to Daniels (1939) and 
Welch (1956) 

Thus the confidence limits for 
given abstract variance 


a 
can be desig- 


nated as: 
r PRITE T OR 
Pr | c2 — Z; hl Lé <o,2 
V2 ko 2 
+Z; n: —1 J-e, [5] 


where Z; is the normal standard de- 
viate for a given probability, P,; o 
the “true” value for the Population 
variance, and we assume that, through 
large n or the use of an unbiased esți- 
mate, the distribution of the variance 
is normal, 

Finally, to determine the confidence 
limits of heredity-environment vari- 


3 in 
P:P: = .81, whereas assuming lack of i 
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g 
Ton x N 
3 Or, in general 


ance ratio’s, e.g., 
Tbo 


2A we simplify the probability state- 
oy 


ments for numerator and denominator 
as follows: 


Pr[A <ê < B] =P, [6a] 
PLC <é? <D] =P, [6b] 


The combined probability for the 
ratio is not strictly the product P:P2 
because P, and P, are not entirely 
independent; but when using the ex- 
treme confidence interval ranges (.95 
to .99) in which we are alone interested, 
the error from assumption of inde- 


pendence is very small? and we 
May use: 


C 


As with any confidence limits test, 
it is not possible to decide, without 
actual empirical variance data carry- 
ing experimental error, what the de- 
sirable number of cases should be in @ 
well planned experiment. However, 
with such a major experimental 
undertaking as is involved in a really 
well designed MAVA experiment, uy 
determine nature-nurture ratios fo! 
some particular primary personality 
Source trait, obtaining the earlies 
Possible indication from experience 
as to what is adequate is vital. Eor 
tunately, a lead can now be obtain 
by applying the above formulae to t T 
sample sizes achieved in the a 
investigations of inheritance of Pi 
mary personality factors by Cattelh 
Blewett, and Beloff (1955) and Catte Z 
Kristy, and Stice (1957). me 
studies measured a total of Ae 
eleven- and twelve-year old childr 


ual 
7 For example with both P, and Pz ed 


t 
ith b e ge 
to .90 and assuming independence W de- 


a <2 <ž] = PPa K] 
B `g 


pendence this becomes .80. 
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on all personality factors established 
up to that time, using, first, question- 
naire measures (the High School 
Personality Questionnaire) and, sec- 
ondly, objective tests (the Children’s 
Objective-Analytic Battery). Taking 
for illustration first a factor from the 
questionaire, and one with the highest 
of hereditary determinations (next to 
intelligence), we choose H factor, 
parmia vs. threctia, a temperament 
factor believed associated with para- 
sympathetic system predominance. 


Two" : = 
utat za = 32 
The value —, came o L39 $ 


Twn 
making heredity about twice as im- 
portant, within the family as en- 
vironment. According to our present 
formulae above the P = .05 limits 
are approximately .18 and 1.39. 
Taking as different a case as possi- 
ble for our second illustration we turn 
to a predominantly environmentally 
determined factor and one measured 
in objective tests. Let us examine the 
between family variance. For this 
we chose UI 23 (Cattell, 1957) which 
Eysenck has called the general neu- 
roticism factor and which we call 


ergic regression. The value of —; 


came out at 6.7, and the P = .05 
limits by the above formulae stand 
at 1.9 and 32.0. All values make 
environment more important than 
heredity, but the range in this case 
1s very large. 

It is well known that as one extends 
the number of linear combinations 
from which estimations are made the 
Magnitude of the standard error in- 
creases in the disheartening fashion 
Seen in the above results. However, 
the conclusion to be drawn, since 
knowledge of the relative environ- 
mental and hereditary contributions 
on continuous personality and ability 
dimensions is of great theoretical and 
Practical importance (and since the 
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twin method has similarly large sam- 
pling errors and is additionally, fun- 
damentally amiss in its assumptions !) 
is surely that research must foot the 
bill. To say that the method is self- 
defeating because it requires 10,000 
rather than 1,000 cases is to use an 
argument which, applied to the an- 
alogous wearisome organization re- 
quired in separating radium, would 
have brought the work of the Curies 
to a standstill, or, in our own genera- 
tion, would have blocked the way to 
atomic energy. Those who believe 
that advance in the human sciences 
is as important as in the physical 
sciences will not blink at the expendi- 
ture which, though large compared 
with a rat experiment, is still a 
hundredth of that in several current 
physical science problems. At present 
(unless one pursues the mirage of 
analogy from animal experimenta- 
tion) the MAVA method, despite its 
experimental-organizational arduous- 
ness, is the only way either to check 
basic theories or, what is more im- 
portant, generate more exact and 
potent hypotheses, as discussed be- 
low. Nevertheless, it is presented 
here as unfinished in development, 
and requiring improvements, notably 
in refinement of the estimation of 
standard errors, in the use of the 
attenuation correction, and in regard 
to the assumption of noncorrelation 
in Equation 7, etc. These issues 
would require separate papers in 
themselves. s 

While no space can be given here 
to heading off beforehand the numer- 
ous misunderstandings possible on so 
complex a topic, we would point out 
that perusal of the discussion of as- 
sumptions in their psychological con- 
text in the earlier articles (Cattell, 
1953 ; Cattell, Blewett, & Beloff, 1955; 
Cattell, Kristy, & Stice, 1957) will 
make the present statements of for- 
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mulae, etc., more meaningful to the 
psychologist. 


PSYCHOLOGICAL THEORY IMPLICIT 
IN THE MAVA MOopDEL 


What has been propounded above 

is not a theory, in the sense of most 
discussed in this Review, but a model 
Burt and Howard (1957), and a few 
others on intelligence, and of Cattell, 
Beloff, and Blewett (1955), and Cat- 
tell, Kristy, and Stice (1957) on per- 
sonality, nothing existed to restrict 
and a method, by which a consider- 
able array of theories can be expressed 
and tested. In this area the vital need 
of psychology has been the initial 
appearance of some dependable rela- 
tionships out of which a theory 
worthy of that name, rather than 
mere speculation, can be born. Apart 
from the work of Burks (1928), 
Burt and Howard (1957), and a few 
others on intelligence, and of Cattell, 
Beloff, and Blewett (1955), and Cat- 
tell, Kristy, and Stice (1957) on per- 
sonality, nothing existed to restrict 
speculators from ascribing anything 
from 0.1% to 99.9% of the variance 
in any human trait to whichever 
influence their Prejudices favored, 

The restrictions which the present 
model puts upon theories (additive 
hereditary and environmental effects, 
interaction of heredity and environ- 
ment by correlation and covariance 
but not statistical interaction) are 
slight compared to the aid it gives in 
checking many theories. Among the 
theories deserving experimental 
vestigation are: 

1. The position adopted in our own 
investigations that the variance ratio 
for any factorially composite piece of 
behavior will be middling but that 
more clear-cut predominance of hered- 
ity or environment will be found for 
factor-pure source traits (Cattell, 
1946). Although it is theoretically 


in- 
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possible, as mooted by Thomson and 
the present writer (1946) that a 
genetic factor (gene) may appear as 
a statistical factor it is almost certain 
that constitutional factors among the 
psychologists factors will _be poly- 
genic. Nevertheless there is a tend- 
ency for factors to be predominantly 
either environmental or hereditary 
(Cattell, Kristy, & Stice, 1957). 

2. The theory that the occurrence 
of environmental influences at par- 
ticular (early) ages, as in Freudian 
theory or in imprinting (Hess, 1959), 
is more effective than at others. In 
animals this theory can be tested by 
manipulative experiment, but in hu- 
mans only by some form of the 
MAVA method applied to groups with 
different histories. In this connection 
an editorial critic has asked, “How 
can one possibly separate maturation 
from learning, age decline, etc., when 
these operate differently at different 
ages?” The answer is that it is pre- 
cisely through, and only through, 
first obtaining these ratios for strategi- 
cally chosen age levels, cultures, etc» 
that further inferences can be made 
regarding the periods of life, or the 
subcultural influences, determining 
the relative action of maturation an 
learning. For example, it was only 
through the discovery of the predomi- 
nantly negative value of the heredity- 
environment correlations, from the 
above experiments, that the "law © 
Coercion to the bio-social mean’ Sa 
6 below) became justified as a worta 
while proposition regarding heredity 
environment interaction. to 

3. The theory that, contrary di- 
Popular thinking, the ratio of ee 
tary influence will be higher with al iy 
Ss. This assumes that younger ahil 
dren will have been exposed Pode 
smaller sample of the total eo 
ment and will experience in this nits 
Posure the greater variance of rhe 
characteristic of smaller samples. 
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hereditary variance remaining un- 
changed, the nature-nurture ratio 
should therefore be greater when 
older Ss are used in the MAVA 
experiment. 

4. A number of interesting theories 
are frequently entertained about the 
effects of family positions and con- 
stellations, of age differences between 
sibs and between children and parents, 
upon personality traits. Such theories 
can be systematically investigated by 
determining the relative environ- 
mental contributions for strategically 
chosen groups of Ss in repeated 
MAVA experiments. 

5. The importance of a general 
“family atmosphere,” e.g., in the 
genesis of neurosis, can be checked 
through the comparison of within- 
family and between-family environ- 
mental variance on the MAVA ap- 
plied to measures of neuroticism 
factors. 

6. A variety of theories needs to be 
tested in regard to the causal direc- 
tions in discovered correlations of 
heredity and environment. Does a 
positive correlation, for intelligence, 
mean that brighter children create a 
richer environment or that for social 
reasons they tend to be born into one? 
This can be answered partly by the 
comparison of Treva and wewa. Does 
the large negative correlation of 
heredity and environment of the ergic 
regression factor, UI 23, (Eysenck's 
“neuroticism factor”) require a theory 
that the neurosis-prone avoid the 
Stresses that might precipitate neu- 
rosis? The general finding that 
hereditary and environmental influ- 
ences tend to be negatively correlated 
with the more substantially inherited 
traits has already been the source ofa 
theory (Cattell, 1957) of “cultural 
Coercion to the bio-social norm.” 

7. A variety of theories in cultural 
anthropology and social psychology 
Can now be examined through a com- 


parison of the MAVA experimental 
results ¿»z different countries. For ex- 
ample, a democratic organization of 
society, providing opportunity for 
all, should raise the importance of 
heredity relative to environment in 
determining, for example, perform- 
ance on intelligence tests. 

8. The model also permits examina- 
tion of theories regarding the effect of 
one trait upon another. A constitu- 
tional endowment of high schizo- 
thymia (A factor), for example, is to 
be considered part of the environment 
for another trait in the same indi- 
vidual, e.g., high intelligence. A 
comparison of MAVA findings on 
intelligence ratios for two types of 
Ss—those preselected to be high and 
low, respectively, in A factor—can test 
and shape initial theories. 

9. For the personality psychologist 
the most important theories of all 
are those concerning the hereditary 
or environmental origins of particular 
personality factors. For example, 
the finding that the H primary in the 
second-order extraversion factor (Cat- 
tell & Scheier, 1960) is largely heredi- 
tarily determined whereas F, sur- 
gency-desurgency, is equally a com- 
plete product of environmental learn- 
ing situations, throws considerable 
new light on extraversion. Also, at 
one stroke, it cuts the theories to 
be investigated regarding H to largely 
physiological ones, and those regard- 
ing surgency to various possibilities 
in learning theory. A lot of theo- 
retical wild goose chases are avoided. 
Equally theoretically provocative is 
the preliminary finding that Comen- 


tion, UI 20, which some have called 


a superego pattern, is actually heavily 


hereditarily determined. 
The above condensed survey may 


suffice to show that the role of MAVA 
findings in both forming and testing 
theoretical developments is consider- 
able, especially in view of the pro- 
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longed dearth of evidence in this ‘== 
and the previous impossibility 0 
formulating results in terms of within- 
and between-family hereditary and 
environmental effects, and the corre- 
lations among them. 
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MOTIVATION AND CUE UTILIZATION IN INTENTIONAL 
AND INCIDENTAL LEARNING* 


DONALD H. KAUSLER °? ann E. PHILIP TRAPP 


University of Arkansas 


In a recent theoretical article, Easter- 
brook (1959) has attempted to relate 
level of motivation to the utilization of 
irrelevant cues in a wide range of be- 
havioral situations. One of these situa- 
tions pertains to the familiar incidental 
learning paradigm in which subject (S) 
is simultaneously exposed to relevant 
and irrelevant cues, Since this proce- 
dure provides no orienting instructions 
to ensure exposure to the irrelevant 
cues, cognizance of irrelevant cues 
would appear to be dependent upon the 
range or span of perception operating 
during performance on the intentional 
task (relevant cues). In general, as 
the range of perception extends beyond 
the central relevant cues, more periph- 
eral cues are utilized, thus permitting 
more irrelevant or incidental cue learn- 
ing; conversely, as the perceptual range 
constricts, fewer peripheral cues are 
utilized and incidental cue learning is 
reduced. According to Easterbrook, 
drive increments produce a restriction 
in range of cue use. This restriction, 
he clearly points out, may facilitate or 
impair the learning of relevant cues, 
depending upon the complexity of these 
cues, but will always impair the learn- 
ing of peripheral irrelevant cues. As 
noted by Easterbrook, a similar posi- 
tion has been taken by Bruner, Matter, 
and Papanek (1955) relating what they 
call “breadth” of learning to both over- 
motivation and overlearning. 

The present article is concerned fur- 
ther with motivation as a variable in 


1 This article was supported in part by a 
grant from the National Science Foundation. 
2 Now at St. Louis University. 


incidental learning. More specifically, 
it is intended to be an evaluation and 
an extension of the position taken by 
Easterbrook that increasing drive level 
during intentional learning reduces the 
efficiency of incidental learning through 
the resulting decrease in irrelevant cue 
utilization, 

However, before proceeding into the 
crux of the problem, some comments 
will be made on methodological pro- 
cedures in incidental-learning experi- 
ments which should help serve to define 
the scope and limitations of the pro- 


posed extension. 


METHODOLOGIES IN INCIDENTAL 
LEARNING EXPERIMENTS 


In experimental studies investigating 
the principles of learning as they apply 
to incidental versus intentional learning, 
two basic methodologies have evolved. 
The first methodology requires a group 
of Ss to learn a specified task inciden- 
tally and, usually, a second group of Ss 
to learn the same task intentionally. 
That is, the intentional group learns the 
task with specific instructions to learn; 
the incidental group, on the other hand, 
is directed toward the task by some 
orienting instructions but is not in- 
structed to learn the task. The impor- 
tant and critical condition in this meth- 
odology is that Ss in the incidental 
group are given orienting instructions 
that assure perception of the learning 
material without an overt set to learn. 
Learning of the incidental material is 
then assumed to indicate the arousal 
of a covert or generalized set to learn. 
This methodology has been the pre- 
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dominant one employed in, the large 
number of recent investigations of in- 
cidental learning (e.g., Brown, 1954; 
Postman & Adams, 1956a, 1956b, 1957, 
1958; Postman, Adams, & Bohm, 1956 : 
Postman, Adams, & Phillips, 1955; 
Postman & Phillips, 1954; Saltzman, 
1953, 1956, 1957). Studies in this 
methodological framework typically ma- 
nipulate systematically either task, pro- 
cedural, or orienting variables in an 
attempt to clarify the nature of the 
processes involved in incidental learn- 
ing. Rarely, however, is motivation 
systematically manipulated within this 
framework. An important exception is 
a study by Spielberger, Goodstein, and 
Dahlstrom (1958) in which level of 
anxiety was investigated as a variable 
in incidental learning. In this study, 
however, only an incidental group was 
used without a corresponding group 
given instructions to learn, 
In the second methodology, each 
group of Ss employed in the experi- 
ment learns both an intentional task 
with instructions to learn and a differ- 
ent incidental task without instructions 
to learn. In other words, in this de- 
sign, S serves as his own control for 
both intentional and incidental learning, 
This methodology lends itself more 
readily to investigating the effects of 
motivational variables on both inten- 
tional and incidental learning than does 
the first methodology. It js apparent 
that the restricting influence of motiva- 
tion formulated by Easterbrook js ap- 
plicable mainly to this methodology, In 
fact, four of the five studies cited by 
Easterbrook (Bahrick, 1954; Kohn, 
1954; Silverman, 1954; Silverman & 
Blitz, 1956) as providing evidence for 
the effects of motivational concentra- 
tion on irrelevant cue utilization in 
human learning are within this frame- 
work. The discussion that follows will 
therefore he restricted primarily to this 
methodology. 
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THE Nature oF MOTIVATION AS A 
VARIABLE IN CUE UTILIZATION 


In his treatment of motivation as a 
variable in cue utilization, Easterbrook 
(1959) is referring specifically to emo- 
tionally based drives. He states: 


it is an empirically derived generalization 
that, when the direction of behavior is con- 
stant, increase in drive is associated | with a 
reduction in the range of cue use. This paper 
is devoted to the expansion and exploration 
of this generalization (pp. 183-184). 


He further states : 


The term drive . . . refers to a dimension 
of emotional arousal or gencral covert P 
citement, the innate response to a state O 
biological deprivation or noxious stimulation, 
which underlies or occurs simultaneously 
with overt action and affects its strength and 
course. This emotional arousal is greater in 
neurotic than in normal subjects, greater 
than usual in subjects under stress or threat 
or in frustration, and in general greater mM 
animals that have been “motivated” by any 
of the usual deprivations, noxious stimula- 
tions, or other incentives than it is in unmo- 


tivated or resting animals of the same species 
(p. 184). 


From these statements it seems aa 
parent that Easterbrook is attempting 
to relate generalized drive (D), as co” 
ceptualized by the Iowa group, to cue 
utilization. 2 

One of the studies cited by Easter- 
brook as evidence for motivational re- 
striction, in human learning, however 
clearly does not attempt to manipulat $ 
generalized D. This is the study y 
Bahrick (1954) in which motivation z 
heightened by offering financial T 5 
wards as an incentive in the high m0” 
tivational condition. Moreover, in for 
lating motivation to perceptual-mo in 
performance, Easterbrook considers nd 
detail a study by Bahrick, Fitts, aoe 
Rankin (1952) in which motivati 
Was again manipulated by financia tëre 
centives. This tendency on Eas na 
brook’s part to equate MonvAN 
states aroused by specific sets with § 
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eralized D seems open to serious crit- 
icism, It is on this point that the 
thinking of the present writers begins 
to differ. 

That the nature of the motivation 
operating during performance on the 
intentional task may be an important 
determiner of irrelevant cue utilization 
is evident from a recent study by 
Kausler, Trapp, and Brewer (1959) in 
which they duplicated the intentional 
and incidental tasks used by Bahrick 
(1954) with groups motivated by a 
generalized D, anxiety. In the Bahrick 
situation, motivation was clearly manip- 
ulated by an incentive-oriented set. He 
found that a high incentive group 
learned a serial list of geometric forms 
(relevant cues) more efficiently than a 
low incentive group. However, the 
high incentive group displayed signifi- 
cantly less incidental learning (irrel- 
evant cues) than the low incentive 
group in the learning of colors filling in 
the geometric forms. These results are 
interpreted by Easterbrook as indicat- 
ing a restriction or funneling of cue 
utilization in the high motivational 
group. The study of Kausler, Trapp, 
and Brewer (1959), however, yielded 
conflicting results. In this study the 
drive condition was generalized emo- 
tional D rather than incentive-oriented 
set. Two separate experiments were 
performed. In the first, anxiety was 
defined by score on the Taylor MAS; 
in the second, anxiety was induced by 
intense ego involving instructions. In 
both experiments, as in the Bahrick 
study, the high drive group did signifi- 
cantly better on the intentional task 
than the low drive group, but, unlike 
the Bahrick study, no difference was 
found between the two groups on the 
incidental task. 

In comparing the Bahrick (1954) 
and Kausler et al. (1959) studies, the 
differential feature was in the nature 
of the motivation employed, incentive- 


oriented set versus generalized D. 
Farber (1955) has emphasized that 
nongeneralized motives have directive 
aspects but do not have dynamogenic 
properties. He states: 


The supposition that motivational variables 
have steering (associative) properties as 
well as energizing and reinforcing (drive) 
properties does not, of course, imply that the 
two kinds of properties are identical. Indeed, 
the virtue of the foregoing analyses of the 
concept of motivation lies in their separation 
of these functions. The associative function 
of a motive or any other variable is identified 
in terms of its tendency to elicit or facilitate 
a limited class of responses only. The drive 
function of a variable is demonstrated if: 
(a) its presence energizes or intensifies in- 
discriminately all reaction tendencies existing 
in a given situation; and/or (b) its elimina- 
tion or reduction in magnitude is reinforc- 
ing, i.e., leads to the increased probability of 
recurrence, in the same situation, of the re- 
sponses preceding its modification. Motives, 
it is presumed, have both functions, and ifa 
variable clearly does not have both, its status 
as a motive is questionable (pp. 311-312). 


Thus, incentive-oriented sets would 
seem to possess associative properties 
without drive properties. That is, they 
elicit particular responses that are goal 
directed but do not energize other re- 
sponses that are irrelevant to the goal. 
In general, sets of this nature are ex- 
perimentally induced by encouraging 
efficient performance on the relevant 
task without intentionally arousing emo- 
tional responses. This is typically ac- 
complished by means of rewards or by 
task-oriented instructions. Generalized 
D, on the other hand, serves an energiz- 
ing function in that its presence intensi- 
fies indiscriminately all reaction ten- 
dencies existing in a given situation, 
both relevant and irrelevant to the cen- 
tral task. In general, D is experi- 
mentally manipulated by the intentional 
arousal of emotional responses during 
the learning process. This is typically 
accomplished by differentiating between 
Ss on some measure of intrinsic anxiety 
or by inducing anxiety through the use 
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of ego involving instructions, failure, 
ess, etc. F 
a It is conceivable that in both the 
Bahrick and Kausler et al. studies in- 
crements in motivation, either associa- 
tive or drive, resulted ina reduction of 
habit strength for the irrelevant cues 
through the restriction of perceptual 
range proposed by Easterbrook. Thus, 
the high motivational groups acquired 
less irrelevant habit strength than the 
low motivational group. In the Bah- 
rick study this difference in habit 
strength is manifested because the dif- 
ference in motivation between the two 
groups is relevant only to the inten- 
tional task and, therefore, does not 
enter into the multiplicative relationship 
(cf. Spence, 1958a, 1958b) between 
irrelevant habit strength and drive level. 
In the Kausler et al, study the gen- 
eralized D would be expected to inten- 
sify the relatively weaker habit strength 
in the high drive group, thus bringing 
its response potential to a point com- 
parable to that in the low drive group 
where habit Strength is stronger but 
drive level is weaker, Hence, the 


groups would not differ in incidental 
learning. 


POSITIONING oF RELEVANT AND 
IRRELEVANT CUES 


The results of the Kausler et al. 
study also conflict with the studies by 
Silverman (1954) and Silverman and 
Blitz (1956) cited by Easterbrook as 
yielding supporting evidence for the 
restrictive influence of high motivation. 
In all three studies motivation was 
manipulated by an emotionally based 
generalized D. In the Silverman study 
anxiety was induced by threat of shock, 
and in the Kausler et al. study it was 
induced by ego involving instructions, 
In both the Silverman and Blitz and 
Kausler et al. studies intrinsic anxiety 
was employed by selecting extreme 
scorers on the MAS. 
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In both Silverman studies an increase 
in anxiety level was followed by the 
expected decrease in incidental learn- 
ing in constrast to the no difference re- 
ported in the Kausler et al. study. Al- 
though the drive state does not differ in 
the three studies, there is another im- 
portant differential between them. This 
is in terms of the relationship of the 
irrelevant cues to the relevant cues 
within the perceptual field. In the 
Kausler et al. study, as in the Bahrick 
study, the irrelevant cues are embodied 
within the relevant cues (colors within 
geometric forms), thus assuring close 
proximity, both spatially and tempo- 
rally, between relevant and irrelevant 
cues. Consequently, even though anx- 
ious Ss may have a restricted percep- 
tual range, the irrelevant cues may well 
fall within that range. In contrast, the 
relevant and irrelevant cues in the Sil- 
verman studies are widely spaced within 
the perceptual field. In fact, in the 
Silverman (1954) study, the relevant 
cues are perceived by a different sense 
modality (vision) than are the irrel- 
evant cues (audition). In the Silver- 
man and Blitz (1956) study, the irrele- 
vant cues (two-digit numbers) occupied 
a different section of a memory drum 
opening than did the relevant cues 
(nonsense syllables), and were sepa- 
rated from the relevant cues by six 
centimeters, It is very possible, there- 
fore, that in the Kausler et al. study the 
differential perception between high 
and low anxious groups was not as 
Pronounced as in the Silverman studies. 

It is conceivable then that the amount 
of incidental learning under high and 
low motivation is affected differentially 
by position of the irrelevant cues within 
the perceptual field, That is, there Me 
an interaction effect between level © 
motivation and cue positioning. In 
general, as the irrelevant cues are 
moved toward the periphery of the per- 
ceptual field, incidental learning will de- 
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crease, but the reduction will be greater 
for the high motivational levels than for 
low. Considerable systematic investi- 
gation is needed to establish the validity 
of this proposed hypothesis. 


TASK AND PROCEDURAL VARIABLES 


Other important variables to take 
into account in evaluating the effects 
of increased motivation on intentional 
and incidental learning are variables re- 
lating to the nature of the intentional 
task and method of presenting the in- 
tentional task. Particularly important 
variables would seem to be variables re- 
lating to the difficulty of the intentional 
task, such as, length, intratask com- 
petition, rate of presentation, etc. A 
number of studies employing the first 
methodology discussed earlier have sys- 
tematically investigated these variables 
and demonstrated their importance in 
incidental learning studies (e.g., Nei- 
mark & Saltzman, 1953; Postman & 
Adams, 1957; Postman, Adams, & 
Phillips, 1955). 

Differences in the tasks used, both 
intentional and incidental, make it more 
arduous to compare the results of stud- 
ies with the simultaneous relevant-irrel- 
evant cue methodology. This is espe- 
cially true when motivation is varied 
qualitatively from study to study. 
Kausler and Trapp (1959) have sug- 
gested that studies making cross-drive 
comparisons should use, whenever pos- 
sible, the same task in order to control 
for task effects. 

Some evidence that the efficiency of 
incidental learning under different lev- 
els of motivation may be related to task 
difficulty is indicated in the study of 
Spielberger, Goodstein, and Dahlstrom 
(1958). Using the first methodology 
but without an intentional group, they 
found that the relationship between 
anxiety level (as defined by score on 
the MAS) and incidental learning 


(learning the designs on the Bender- 
Gestalt test) was a function of the 
difficulty of the learning material. On 
the easy designs the high anxious Ss 
performed at a higher level than the 
low anxious Ss; on the difficult designs 
the low anxious Ss performed at a 
higher level than the high anxious Ss. 
The present writers believe that task 
difficulty variables operate in the inten- 
tional-incidental learning situation by 
altering the degree of concentration re- 
quired of Ss. Rate of presentation, for 
example, would be important in that 
the faster the rate, the greater the de- 
gree of concentration demanded. It is 
here suggested that intentional task var- 
iables which alter the degree of concen- 
tration required of S will affect further 
the range of attention for simultane- 
ously present irrelevant ones. How- 
ever, the net effect on the learning of 
irrelevant cue habits should also be a 
function of the nature of the motivation 
present (incentive-oriented versus gen- 
eralized D), the level of motivation, and 
the positioning of irrelevant cues within 
the perceptual field. Thus studies re- 
lating degree of motivation to irrelevant 
cue utilization should also take into ac- 
count the nature of the drive, the nature 
of the intentional task, and the position- 
ing of the irrelevant cues. . 
For example, on easy tasks which 
require little concentration, high gen- 
eralized drive Ss should display more 
incidental learning than low drive Ss 
(as in the Spielberger et al., 1958, 
study). The perceptual range should be 
fairly broad under these conditions and 
should incorporate irrelevant cues within 
close proximity to the relevant cues. 
Since generalized D reacts multiplica- 
tively with irrelevant habit strength, the 
reaction tendency for high drive Ss 
should be greater than for low drive Ss. 
On difficult tasks, however, the percep- 
tual range for irrelevant cues should be 


much narrower. Consequently, when 


378 Donatp H. KAUSLER 
combined with the further restricting 
sonbini high drive, the attention ir 
these cues and subsequent habit streng ; 
should be considerably lower for high 
drive Ss than for low drive Ss. Under 
these conditions, low drive Ss should 
begin to display superior incidentai 
learning. ’ 4 
With incentive-oriented motivation, 
however, low motivation might be ex- 
pected to result in superior incidental 
learning regardless of intentional task 
difficulty. This is likely because the 
motivation involved does not enter into 
a multiplicative relationship with habit 
strength. Nevertheless, the difference 
in incidental learning Proficiency be- 
tween high and low incentive groups 
could be expected to diminish as the 
intentional task decreases in difficulty. 
That is, the highly motivated Ss would 
need to concentrate less on easy tasks, 
thus reducing the funneling effect on 
their attention to irrelevant cues; con- 
versely, on difficult tasks requiring in- 
tense concentration, the funneling effect 
of high incentive motivation should be 
most pronounced, 

It is obvious that considerab 
tematic research is needed, ma 
ing task variables along with motiva- 
tional and positional variables, before 
the overall effects of level of motivation 
on irrelevant cue utilization can be de- 
termined. It does seem clear, however, 
that the phenomena Easterbrook seeks 
to embrace are too complex to be uni- 
fied under one general principle, appeal- 
ing as the thought may be. Even at 
this stage, the empirical data demand 
some extensions to his formulations. 
The elaborations presented here do 
reconcile existing conflicting findings, 
How well they can handle derivable 
hypotheses, like those suggested in the 
foregoing, remains to be seen. Studies 
to that effect are currently in process in 

our laboratories. 


le sys- 
nipulat- 
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SUMMARY 


Easterbrook has proposed that in- 
creasing motivation (specifically defined 
as emotional D) results in a funneling 
or focusing of attention to relevant cues 
when Ss are simultaneously exposed to 
relevant and irrelevant cues. The pres- 
ent article suggests that the net effects 
of increasing motivation on cue utiliza- 
tion during incidental learning are func- 
tions of additional variables as well. Of 
particular importance seem to be the 
nature of the motivation (generalized 
D versus incentive-oriented set), posi- 
tion of the irrelevant cues relative to the 
relevant cues (central versus periph- 
eral), and difficulty of the intentional 
task. This extension of Easterbrook’s 
framework brings on a rapprochement 
with existing research findings and 
opens the way to more systematic 1n- 
vestigations into an admittedly complex 
area. 
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E RETENTION 
VARIATION AND SELECTIV 
ee CREATIVE THOUGHT AS IN OTHER 


KNOWLEDGE 


PROCESSES? 


DONALD T. CAMPBELL 


Northwestern 


i aper proposes to examine 
ative fought wihin the framework 
of a comparative psychology of knowl- 
edge processes, and in particular with 
regard to one theme recurrent in most 
knowledge processes, This theme may 
be expressed as follows : 


1. A blind-variation-and- 
process is fundamental to all inductive 
achievements, to all genuine increases in 
knowledge, to all increases in fit of system 
to environment. 

2. The many Processes which shortcut a 
more full blind-variation-and-selectiye-reten. 
tion process are in themselyes inductive 
achievements, containing wisdom about the 
environment achieved originally by blind 
variation and selective retention, 

3. In addition, such 
contain in their own 
variation-and-selective-re 
some level, substituting 
exploration or the life 
of organic evolution, 


selective-retention 


shortcut Processes 
Operation a blind- 
tention process at 
for overt locomotor 
~and-death winnowing 


Between a modern experimental 
physicist and some virus-type ancestor 
there has been a tremendous gain in 
knowledge? about the environment, 


1A partially overlapping version of this 
paper was presented at the Inter-Discipli- 
nary Conference on Self-Organizing Systems, 
sponsored by the Office of Naval Research 
and the Armour Research Foundation of the 
Illinois Institute of Technology, Chicago, 
May 5-6, 1959. The proceedings of the con. 
ference are to be published by Pergamon 
Press under the title Self-Organizing Sys- 
tems. The author is indebted to Carl p, 
Duncan for contributing to the development 
of many of the points involved. ae 

? This extended usage of “knowledge” is 
a part of an effort to put “the problem of 
knowledge” into a behavioristic framework 
which takes full cognizance of man’s Status 
as a biological product of an evolutionary 
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University 


In bulk, this has represented cumulated 
inductive achievements, stage by stage 
expansions of knowledge beyond what 
could have been deductively derived 
from what had been previously known. 
It has represented repeated “breakouts 
from the limits of available wisdom, for 
if such expansions had represented 
only wise anticipations, they would 
have been exploiting full or partial 
knowledge already achieved, Instead, 
Se 


development from a highly limited back- 
ground, with no “direct” dispensations of 
knowledge being added at any point in the 
family tree. The bibliographical citation 
of the several sources converging on this 
approach to the problem of knowledge, and 
the discussion of its relation to traditional 
Philosophical issues and to the strategy of 
science are presented elsewhere (Campbell, 
1959). Suffice it to say here that the posi- 
tion limits one to “an epistemology of the 
other one.” The “primitives” of knowledge 
san not be sought in “raw feels” or on. 
“phenomenal givens,” or in any “incorrigible 
elements. While man’s conscious knowl- 
edge Processes are recognized as more com- 
plex and subtle than those of lower organ- 
isms, they are not taken as more funda- 
mental or primitive. In this perspective, 
any process Providing a stored program for 
Organismic adaptation in external environ- 
ments is included as a knowledge proc 
and any gain in the adequacy of such a 
Program is regarded as a gain in knowledge: 

the reader prefers, he can See 
the paper adequately regarding the Tied 
“knowledge” as metaphorical when app E 
to the lower levels in the developmen 1- 
hierarchy. But since the problem of at 
edge has resisted any generally accep a 
solution when defined in terms of the welt 
Scious contents of the philosopher Hi THe 
little seems lost and possibly something 
gained by thus extending the range 
Processes considered. 
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real gains must have been the products 
of explorations going beyond the limits 
of foresight or prescience, and in this 
sense blind. In the instances of such 
real gains, the successful explorations 
were in origin as blind as those which 
failed. The difference between the 
successful and unsuccessful was due to 
the nature of the environment en- 
countered, representing discovered wis- 
dom about that environment. 

The general model for such induc- 
tive gains is that underlying both 
trial-and-error problem solving and 
natural selection in evolution, the 
analogy between which has been 
noted by several persons (e.g., Ashby, 
1952; Baldwin, 1900; Pringle, 1951). 
Three conditions are necessary: a 
mechanism for introducing variation, a 
consistent selection process, and a 
mechanism for preserving and repro- 
ducing the selected variations. In what 
follows we shall look for these three 
ingredients at a variety of levels. But 
first a comment on the use of the word 
“blind” rather than the more usual 
“random.” It seems likely that Ashby 
(1952) unnecessarily limited the gen- 
erality of his mechanism in Homeostat 
by an effort to fully represent all of 
the modern connotations of random. 
Equiprobability is not needed, and is 
definitely lacking in the mutations 
which lay the variation base for organic 
evolution. Statistical independence be- 
tween one variation and the next, while 
frequently desirable, can also be spared : 
in particular, for the generalizations 
essayed here, certain processes involv- 
ing systematic sweep scanning are 
recognized as blind, insofar as varia- 
tions are produced without prior knowl- 
edge of which ones, if any, will furnish 
a selectworthy encounter. An essential 
connotation of blind is that the varia- 
tions emitted be independent of the en- 
vironmental conditions of the occasion 
of their occurrence. A second important 


connotation is that the occurrence of 
trials individually be uncorrelated with 
the solution, in that specific correct 
trials are no more likely to occur at any 
one point in a series of trials than an- 
other, nor than specific incorrect trials. 
(Insofar as observation shows this not 
to be so, the system is making use of 
already achieved knowledge, perhaps 
of a general sort. The prepotent re- 
sponses of an animal in a new puzzle 
box toward the apparent openings may 
thus represent prior general knowledge, 
transferred from previous learning or 
inherited as a product of the mutation 
and selective survival process.) A 
third essential connotation of blind is 
rejection of the notion that a variation 
subsequent to an incorrect trial is a 
“correction” of the previous trial or 
makes use of the direction of error of 
the previous one. (Insofar as mech- 
anisms do seem to operate in this 
fashion, there must be operating a 
substitute process carrying on the blind 
search at another level, feedback cir- 
cuits selecting “partially” adequate 
variations, providing information to the 
effect that “you're getting warm,” etc.) 


Review oF THE THEME IN LOWER 
KNOWLEDGE PROCESSES 


In this. perspective, the epistemo- 
logically most fundamental knowledge 
processes are embodied in those several 
inventions making possible organic 
evolution. At the already advanced 
level of cellular life, this is a “learning” 
on the part of the species by the blind 
variation and selective survival of 
mutant individuals. In terms of the 
three requirements, variation is pro- 
vided by the mutations, selection. by 
the somewhat consistent or “‘knowable” 
vagaries of the environment, and pres- 
ervation and duplication by the com- 
plex and rigid order of chromosome 
mitosis. Bisexuality, heterozygosity, 
and meiotic cell division represent a 
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secondary invention increasing the cr 
ficiency of the process through increas- 
ing the range of variation and the rate 
of readjustment to novel environments. 
The selection and preservation proc- 
esses remain the same. The ubiquity 
of bisexuality, its several independent 
inventions, and the multifarious elab- 
oration of the theme, all speak to its 
tremendous usefulness. 
The higher evolutionary develop- 
ments shift a part of the locus of 
adaptation away from a trial and error 
of whole organisms or gene pools, 
over to processes occurring within the 
single organism. Such processes are 
numerous, each being not only a device 
for obtaining knowledge, but also 
representing general wisdom about 
environmental contingencies already 
achieved through organic evolution, 
making possible more efficient achieve- 
ment of detailed local knowledge. One 
of the most primitive of these is ex- 
ploratory locomotion, described in the 
protozoa by Jennings (1906) and ac- 
cepted as a model for Homeostat by 
Ashby (1952). Forward locomotion 
persists until blocked, at which point 
direction of locomotion is varied blindly 
until unblocked forward locomotion is 
again possible. The external physical 
environment is the selection agency, the 
preservation of discovery is embodied 
in the perseveration of the unblocked 
forward movement. At this level, the 
species has “discovered” that the en- 
vironment is discontinuous, consisting 
of penetrable regions and impenetrable 
ones, and that impenetrability js to 
some extent a stable characteristic—jt 
has discovered that when blocked jt 
is a better strategy to try to go around 
than to wait until one can move 
through. . 
Insofar as individual organisms with- 
out distance receptors (such as para- 
mecia and earthworms) can learn 
through contiguity, the species has al- 
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ready achieved the more general knowl- 
edge that there is some event-contin- 
gency stability in the environment. 
That is, in the degree to which indi- 
vidual learning is useful, there has 
been the  species-level discovery of 
slower transformation processes on the 
part of relevant segments of the envi- 
ronment than of the organism, In ad- 
dition, whereas the ultimate selection 
is life or death in encounters with the 
external environment, by the evolu- 
tionary stage at which learning is pos- 
sible, much of this once-external crite- 
tion has been internalized. Crude 
environmental contingencies with low 
selection ratios are now represented 
as pleasures or pains, or as reinforcers 
more generally. The selection be- 
comes much more sharp, but the con- 
tact with the environmental realities 
less direct, 

The presence of a fundamental trial- 
and-error process in individual learn- 
ing needs no elaboration or defense. 
Suffice it to say that recognition of 
such a process ig found in all learning 
theories which make any pretense of 
completeness, including at least three 
of Gestalt inspiration (Campbell, 
1956a). While higher vertebrate (and 
higher cephalopod ) learning makes far 
more use of the short circuiting of 
overt trial and error by vision than is 
allowed for by the usual learning 
theory (Campbell, 1956b), for con- 
venience here the multiplication of 
levels will be avoided by treating trial- 
and-error learning as a single process 
level, 

The next and most striking class of 
discoveries are those centering around 
echo-location and vision. Woodworth 
(1921) has emphasized the achieve- 
ment of a percept from the ingredients 
of sensation through a series of “trial- 
and-error perceptions.”  Thurstone 
(1924) has interpreted perception as 
a trial and error of potential locomo- 
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tions placed in a hierarchy of trial- 
and-error processes including both 
overt trial and error and ideational 
trial and error, in a book containing 
many anticipations of cybernetic con- 
cepts. Pumphrey (1950) interprets 
the primitive sense receptor of the 
fishes called the “lateral-line organ” as 
a crude echo-location device, making 
use of the reflected pulses of the fish's 
own swimming. Griffin (1958) has 
documented in detail the use by bats 
and cave birds of sonic and supersonic 
vocalizations selectively reflected by 
obstacles of the environment. Kellogg 
(1958) has made a similar case for 
the porpoise. Here is a powerful 
substitute for blind locomotor explora- 
tion. (See Simon, 1957, p. 264, for 
an estimate of such gains.) In echo 
location a wave pulse is emitted blindly 
in all directions. The obstacles of the 
environment selectively reflect the 
pulse from certain of these directions, 
and thus provide a feedback which is 
substitutable for that which would 
have been received had the animal 
locomoted in those directions. Radar 
guidance systems employ an analogous 
substitution of a blindly scanning elec- 
tromagnetic wave pulse, in economical 
substitution for a blind scanning of 
the same environment of potential 
locomotions by full ship or projectile 
movements. 

Visual perception seems interpret- 
able as a substitute search process of 
similar order (Campbell, 1956b). The 
full analogy is weakened by the absence 
of an emitting process on the part of 
the organism. Instead, advantage is 
taken of diffuse electromagnetic waves 
made available from external sources. 
Consider first a pseudoeye consisting 
of but a single photoreceptor cell. 
(Such a device has been distributed 
for use by the blind in which a photo- 
cell output is transformed into a sound 
of variable pitch.) With such a device, 


blind scanning as in a radar system is 
essential. Brightness contours can be 
located and fixated by continual cross- 
ing, as in the “hunting” process in a 
mechanical servosystem, or as in the 
vocal pitch control in which a steady 
note is “held” only by a continuous 
search oscillation (Deutsch & Clarkson, 
1959). To conceive of such an “eye” 
as a blind searching device substituting 
for a more costly blind locomotion in 
the explored directions is not difficult. 
The eyes of insects and vertebrates and 
the higher cephalopods differ from 
such a device by having multiple photo- 
cells, making possible selective reflec- 
tion from objects in multiple directions 
at once. Each receptor cell can be 
conceived of as exploring the pos- 
sibilities of locomotion in a given direc- 
tion, the retina collectively thus ex- 
ploring the possibilities of locomotion 
in a wide segment of potential direc- 
tions for locomotion. Except as the 
eye is aimed by other sources of knowl- 
edge, these possibilities have been 
made “blindly” available without pre- 
science or insight. For the “blindness” 
of an eyeless animal there has been 
substituted a process so efficient that 
we use it naively as a model for 
direct, unmediated knowing. But the 
process is still one of blind search and 
selective retention, in the sense em- 
ployed in this paper. 

Vision is a very complex and mar- 
velous mechanism, and the brief pres- 
entation here does not do justice even 
to the random search components 1n- 
volved. Hebb (1949) has well docu- 
mented the active search of eye move- 
ments, correcting the model of the 
inactive fixed-focus eye which is im- 
plicit in both Gestalt psychology and 
conditioning theory. — Riggs (Riggs, 
Armington, & Ratliff, 1954) and 
Ditchburn (1955) have documented 
the essential role of the continuous low 
amplitude scanning provided by “phys- 
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iological nystagmus fore sata 
tremor.” Platt (1958) has p blind 
brilliant analysis of the role of a ind 
Pe De Mis enn enna 
model, for the achievement of visual 
acuity and spatial representation in a 
visual system containing unaddressed 
elements. These and other considera- 
tions convince the present writer that 
although vision represents the strongest 
challenge to the generality a blind- 
variation-and-selective-retention aspect 
to all knowledge processes, it is not in 
fact an exception. These brief com- 
ments have not fully justified this con- 
clusion, however. 

Taking these echo location and visual 
exploratory processes collectively, sey- 
eral general aspects can be noted: all 
exploit a specific and limited coin- 
cidence, i.e., that objects impenetrable 
by organismic locomotion also are 
opaque to, or reflect, certain wave 
forms in the acoustical frequencies and 
in the bands of electromagnetic waves 
of the visual and radar spectra. It is 
this coincidence, unpredictable upon 
the basis of the prior knowledge avail- 
able to the more primitive organisms, 
which makes possible such marvelously 
efficient shortcuts. Thus while phe- 
nomenologically vision is more direct 
than other knowledge processes, it is 
seen in this perspective as an indirect, 
substitute process. As in all sub- 
stitute knowledge processes, the ef- 
fectiveness is limited by the accuracy 
of the coding process, i.e., the transla- 
tion terms between one level and 
another. Such coding is never ex- 

haustive (Platt, 1956). It always in- 
volves abstraction, and along with this 
some fringe imperfection and pronenesg 
to systematic error. It must finally 
be checked out and corrected by overt 
locomotion. Its efficacy is limited by 
the relevance of the coding to the more 
fundamental level of behavior for which 
it is a substitute. This relevance was 
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itself initially tested out by a blind- 
variation-and-selective-retention proc- 
ess at the level of organic evolution or 
early childhood learning. (Species 
differ in this regard.) The phenomenal 
directness of vision tempts us to make 
vision prototypic for knowing at all 
levels, and leads to that chronic belief 
in the potential existence of direct and 
“insightful” mental processes, a belief 


which it is one purpose of this paper 
to deny. 


Creative THOUGHT 


Creative thought provides the next 
level knowledge process for the present 
discussion. At this level there is a 
substitute exploration of a substitute 
representation of the environment, the 
“solution” being selected from the 
multifarious exploratory thought trials 
according to a criterion which is in 
itself substituting for an external state 
of affairs, Insofar as the three sub- 
stitutions are accurate, the solutions 
when put into overt locomotion are 
adaptive, leading to intelligent behavior 
which lacks overt blind floundering, 
and is thus a knowledge process. To 
include this process in the general plan 
of blind-variation-and-selective-reten- 
tion, it must be emphasized that insofar 
as thought achieves innovation, the 
internal emitting of thought trials one 
by one is blind, lacking prescience of 
foresight. The process as a whole of 
Course provides “foresight” for the 
Overt level of behavior, once the proc- 
€ss has blindly stumbled into a thought 
trial that “fits” the selection crite- 
tion, accompanied by the “something 
clicked,” “Eureka,” or “aha-erlebnis 
that usually marks the successful tet- 
mination of the process. el 

Today, we find the blind-variation 
and-selective-retention model most plau- 
sibly applied at the levels of organic 
evolution and trial-and-error learning 
of animals, and least palatable as 4 
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description of creative thinking. His- 
torically, however, the phrase “trial 
and error” was first used to describe 
thinking, by Alexander Bain as early 
as 1855, two years before Darwin’s 
publication of the doctrine of natural 
selection. Not only for historical in- 
terest, but also to further develop the 
psychology of creativity, the following 
quotations from him (Bain, 1874) are 
provided : 


Possessing thus the material of the con- 
struction and a clear sense of the fitness or 
unfitness of each new tentative, the operator 
proceeds to ply the third requisite of con- 
structiveness—trial and error— . . . to 
attain the desired result. . . . The number 
of trials necessary to arrive at a new 
construction is commonly so great that 
without something of an affection or fascina- 
tion for the subject one grows weary of the 
task. This is the emotional condition of 
originality of mind in any department 
(p. 593). 

In the process of Deduction . . . the 
same constructive process has often to be 
introduced. The mind being prepared be- 
forehand with the principles most likely 
for the purpose . . . incubates in patient 
thought over the problem, trying and re- 
jecting, until at last the proper elements 
come together in the view, and fall into 
their places in a fitting combination (p. 594). 

With reference to originality in all de- 
partments, whether science, practice, or 
fine art, there is a point of character that 
deserves notice. I mean an Active 
turn, or a profuseness of energy, put forth 
in trials of all kinds on the chance of 
making lucky hits . . . Nothing less than 
a fanaticism of experimentation could have 
given birth to some of our grandest practical 
combinations. The great discovery of 
Daguerre, for example, could not have 
been regularly worked out by any systematic 
and orderly research; there was no way but 
to stumble upon it. . . . The discovery is 
unaccountable, until we learn that the 
author . . . got deeply involved in trials 
and operations far removed from the beaten 


paths of inquiry (P. 595). 


In 1881 Paul Souriau presented a 
still more preponderant emphasis on 


the factor of chance as the sole source 


of true innovation. He asserts again 


and again that “le principe de l’inven- 
tion est le hasard.” In the main, ` 
he presents his argument through il- 
lustration and through the elimination 
of rival hypotheses about the inventive 
process, including deduction, induction, 
and “la methode.” A positive explana- 
tion of the process is hard to find. 
This sample will illustrate his ap- 


proach: 


A problem is posed for which we must 
invent a solution. We know the condi- 
tions to be met by the sought idea; but we 
do not know what series of ideas will lead 
us there. In other words, we know how the 
series of our thoughts must end, but not how 
it should begin. In this case it is evident 
that there is no way to begin except at 
random. Our mind takes up the first path 
that it finds open before it, perceives that 
it is a false route, retraces its steps and 
takes another direction. Perhaps it will 
arrive immediately at the sought idea, 
perhaps it will arrive very belatedly: it 
is entirely impossible to know in advance. 
In these conditions we are reduced to 
dependence upon chance. 

In the case just analysed we supposed 
that we had to solve a problem already 
stated for us. But how was the problem- 
statement itself found? It is said that a 
question well posed is half answered. If 
so, then true invention consists in the 
posing of questions. There is something 
mechanical, so to speak, in the art of find- 
ing solutions. The truly original mind is 
that which discovers problems. But here 
again, it does no good to speak of method, 
since method is the application of already 
existing discoveries. The discovery of a 
new problem can therefore only be fortui- 
tous. Thus we see the role of logic diminish 
and that of chance increase as we approach 
closer to true invention. Chance is the 
first principle of invention: it is what has 
produced method, nourished it, and made it 
fertile. Method can only analyse the ideas 
which come to it from elsewhere, drawing 
out their consequences and exhausting their 
contents, Left to itself method soon be- 
comes sterile. Methodological minds can- 
not help having a feeling of disdain for 
adventurous minds which affirm before proy- 
ing and believe before knowing. But they 

that without such audacity, 


must recognize It i 
no progress would be possible. The mind 
S 


is not able to revise itself upon its own 
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foundations. New ideas cannot have pole 
types: their appearance can only be at- 
tributed to chance (pp. 17-18). 


Souriau has not only the notion of 
chance combinations, but also the con- 
cept of their being produced in large 
numbers which are generally worthless 
and from which only the rare ones 
fitting a goal or criterion are selected. 
These two widely separated quotations 
illustrate this: 


By a kind of artificial selection, we can 
in addition substantially perfect our thought 
and make it more and more logical. Of 
all of the ideas which present themselves 
to our mind, we note only those which 
have some value and can be utilized in 
reasoning. For every single idea of a 
judicious and reasonable nature which of- 
fers itself to us, what hosts of frivilous, 
bizarre, and absurd ideas cross our mind. 
Those persons who, upon considering the 
marvelous results at which knowledge has 
arrived, cannot imagine that the human 
mind could achieve this by a simple fum- 
bling, do not bear in mind the great number 
of scholars working at the same time on 
the same problem, and how much time even 
the smallest discovery costs them. Even 
genius has need of patience. It is after 
hours and years of meditation that the 
sought-after idea Presents itself to the in- 
ventor. He does not succeed without going 
astray many times; and if he thinks him- 
self to have succeeded without effort, 
only because the joy of having succeeded 
has made him forget all the fatigues, all 
of the false leads, all of the agonies, with 
which he has paid for his success (p. 43). 

. .. If his memory is strong enough to 
retain all of the amassed details, he evokes 
them in turn with such rapidity that they 
seem to appear simultaneously; he groups 
them by chance in all the possible Ways; 
his ideas, thus shaken up and agitated in 
his mind, form numerous unstable ag- 
gregates which destroy themselves, | and 
finish up by stopping on the most simple 
and solid combination (pp. 114-115). 


, it is 


The phrase “artificial selection” is 
reminiscent of Darwin’s writings, al- 
though Souriau makes no mention of 
the selective-survival model of evolu- 
tion, (nor does he cite the ideas of 
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any other person whatsoever ) ie Note 
how similar the final quotation is to 
Ashby’s (1952) phrasing of the inev- 
itable self-elimination of unstable com- 
binations : 


Just as, in the species, the truism that the 
dead cannot breed implies that there is a 
fundamental tendency for the successful to 
replace the unsuccessful, so in the nervous 
system does the truism that the unstable 
tends to destroy itself imply that there is 
a fundamental tendency for the stable to 
replace the unstable (p. vi). 


Ernst Mach was another great 19th 
century thinker about thinking who 
emphasized this model. We today 
remember him most as a psychologist- 
physicist-philosopher who contributed 
to the present day positivistic recogni- 
tion of the hypothetic character of our 
constructions of the world and who 
first made explicit the empirical pre- 
sumptions involved in the physicist’s 
assumption of an Euclidian space. But 
When, at the age of 57 in 1895, he 
was called back to his alma mater the 
University of Vienna to assume a 
newly created position of Professor of 
the History and Theory of Inductive 
Science, he chose a quite different 
theme for his inaugural address. His 
title was “on the part played by ace 
cident in invention and discovery- 
The occasion indicates the importance 
he gave to the message, and indeed, 


8 Souriau’s Presentation is in general quite 
modern in spirit, although associationisti¢ 
in a way some would find dated, an 
vigorously deterministic in a way now hie 
mined by subatomic physics, although no 
necessarily so for the problems of whic 
he treats. He comments wisely on many 
topics not covered here, including simulta- 
neous independent invention and the Zeit- 
geist, the social conditions of creativity a" 
invention, the dissonance created by dis- 
crepant opinions of others, the congrui 
of free will and determinism, and both di- 
conflict and interdependency between erud h 
tion and innovation, His attacks on bot 
deduction and induction are reminiscent © 

eirce’s later critiques. 
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his paper is a neglected classic in the 
psychology of knowledge processes. 
These quotations (Mach, 1896) fur- 
ther reinforce the model of creative 
thought being presented : 


The disclosure of new provinces of facts 
before unknown, can only be brought about 
by accidental circumstances . . . (p. 168). 

In such [other] cases it is a psychical 
accident to which the person owes his dis- 
covery—a discovery which is here made 
“deductively” by means of mental copies 
of the world, instead of experimentally 
(p. 171). 

After the repeated survey of a field has 
afforded opportunity for the interposition 
of advantageous accidents, has rendered all 
the traits that suit with the word or the 
dominant thought more vivid, and has 
gradually relegated to the background all 
things that are inappropriate, making their 
future appearance impossible; then from the 
teeming, swelling host of fancies which a 
free and high-flown imagination calls forth, 
suddenly that particular form arises to 
the light which harmonizes perfectly with 
the ruling idea, mood, or design. Then it 
is that that which has resulted slowly as 
the result of a gradual selection, appears 
as if it were the outcome of a deliberate 
act of creation. Thus are to be explained 
the statements of Newton, Mozart, Richard 
Wagner, and others, when they say that 
thoughts, melodies, and harmonies had 
poured in upon them, and that they had 
simply retained the right ones (p. 174). 


Poincaré (1908, 1913) in his famous 
essay on mathematical invention pre- 
sents a point of view which is also 
judged to be in agreement. He first 
gives an example in imagery: “One 
evening, contrary to my custom, I 
drank black coffee and could not sleep. 
Ideas rose in crowds; I felt them 
collide until pairs interlocked, so to 
speak, making a stable combination” 
(Poincaré, 1913, p. 387). Poincaré 
feels that it is rare for this blind 
o rise into con- 


permuting process t 
de d that as a rule 


scious awareness, an r 
only the successful selected alternatives 
enter consciousness. Because of the 
relevance of Poincaré’s comments; be- 


cause Hadamard (1945) has cited him 
in opposition to the accidentalist posi- 
tion while he is read here as favoring 
the selective-retention version of it; 
and because of all of the sources cited 
he would most generally be respected 
as truly creative (in the field of math- 


ematics) these longish excerpts (Poin- 
caré, 1913) are read into the record : 


It is certain that the combinations which 
present themselves to the mind in a sort 
of sudden illumination, after an unconscious 
working somewhat prolonged, are generally 
useful and fertile combinations, which seem 
the result of a first impression. Does it 
follow that the subliminal self, having 
divined by a delicate intuition that these 
combinations would be useful, has formed 
only these, or has it rather formed many 
others which were lacking in interest and 
have remained unconscious? 

In this . . . way of looking at it, all the 
combinations would be formed in con- 
sequence of the automatism of the sub- 
liminal self, but only the interesting ones 
would break into the domain of conscious- 
ness. And this is still very mysterious. 
What is the cause that, among the thousand 
products of our unconscious activity, some 
are called to pass the threshold, while 
others remain below? Is it a simple chance 
which confers this privilege? Evidently 
not; among all the stimuli of our senses, 
for example, only the most intense fix our 
attention, unless it has been drawn to them 
by other causes. More generally the priv- 


ileged unconscious phenomena, those sus- 
ceptible of becoming conscious, are those 
which, directly or indirectly, affect 


most profoundly our emotional sensibility 


(p. 391). . 

. we reach the following conclusion : 
The useful combinations are precisely the 
most beautiful, I mean those best able to 
charm this special sensibility, that all mathe- 
maticians know, but of which the profane 
are so ignorant as often to be tempted to 


smile at it. 
What happens then? Among the great 
numbers of combinations blindly formed by 


the subliminal self, almost all are without 
interest and without utility; but just for 
that reason they are also without effect 
upon the esthetic sensibility. Consciousness 
will never know them; only certain ones are 

d, consequently, at once use- 


ious, an t 
aod beautiful. They will be capable of 
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ete eee eee 
geometer of which I have j s 
which, once aroused, will call our attention 
to them, and thus give them occasion to 
10us. 
Become oiy a hypothesis, and yet here 
is an observation which may confirm it: 
when a sudden illumination seizes upon the 
mind of the mathematician, it usually hap- 
pens that it does not deceive him, but it 
also sometimes happens, as I have said, 
that it does not stand the test of verification $ 
well, we almost always notice that this 
false idea, had it been true, would have 
gratified our natural feeling for mathe- 
matical elegance. 

Thus it is this special esthetic sensibility 
which plays the role of the delicate sieve 
of which I spoke, and that sufficiently 
explains why the one lacking it will never 
be a real creator. 

Yet all the difficulties have not disap- 
peared. The conscious self is narrowly 
limited, and as for the subliminal self we 
know not its limitations, and this is why 
we are not too reluctant in supposing that 
it has been able in a short time to make 
more different combinations than the whole 
life of a conscious being could encompass. 
Yet these limitations exist. Is it likely 
that it is able to form all the possible com- 
binations, whose number would frighten the 
imagination? Nevertheless that would seem 
necessary, because if it produces only a small 

part of these combinations, and if it makes 
them at random, there would be small chance 
that the good, the one we should choose, 
would be found among them. 

Perhaps we ought to seek the explanation 
in that preliminary period of conscious work 
which always precedes all fruitful uncon- 
scious labor. Permit me a rough compari- 
son. Figure the future elements of our 
combinations as something like the hooked 
atoms of Epicurus. During the complete 
repose of the mind, these atoms are motion- 
less, they are, so to speak, hooked to the 
wall; so this complete rest may be indef- 
nitely prolonged without the atoms Meeting, 
and consequently without any combination 
between them. i j; 

On the other hand, during a period of 
apparent rest and unconscious work, certain 
of them are detached from the wall and put 
in motion. They flash in every direction 
through the space (I was about say the 
room) where they are enclosed, as would, 
for example, a swarm of gnats or, if you 
prefer a more learned comparison, like the 
molecules of gas in the kinematic theory of 
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gases. Then their mutual impacts may pro- 
duce new combinations. 

What is the role of the preliminary con- 
scious work? It is evidently to mobilize 
certain of these atoms, to unhook them from 
the wall and put them in swing. We think 
we have done no good, because we have 
moved these elements a thousand different 
ways in seeking to assemble them, and have 
found no satisfactory aggregate. But, after 
this shaking up imposed upon them by our 
will, these atoms do not return to their 
primitive rest. They freely continue their 
dance. 

Now, our will did not choose them at 
random; it pursued a perfectly determined 
aim. The mobilized atoms are therefore not 
any atoms whatsoever; they are those from 
which we might reasonably expect the de- 
sired solution. Then the mobilized atoms 
undergo impacts which make them enter into 
combinations among themselves or with 
other atoms at rest which they struck against 
in their course, Again I beg pardon, my 
comparison is very rough, but I scarcely 
know how otherwise to make my thought 
understood, 

However it may be, the only combinations 
that have a chance of forming are those 
where at least one of the elements is one of 
those atoms freely chosen by our will. Now, 
it is evidently among these that is found 
what I call the good combination. Perhaps 
this is a way of lessening the paradoxical 
in the original hypothesis. 

Another observation, It never happens 
that the unconscious work gives us the result 
of a somewhat long calculation all made, 
where we have only to apply fixed rules. 
We might think the wholly automatic sub- 
liminal self particularly apt for this sort of 
work, which is in a way exclusively mechan- 
ical. It seems that thinking in the evening 
upon the factors of a multiplication we might 
hope to find the Product ready made upon 
our awakening, or again that an algebraic 
calculation, for example a verification, 
would be made unconsciously. Nothing of 
the sort, as observation proves. All one 
may hope from these inspirations, fruits ©: 
unconscious work, is a point of departure 
for such calculations, As for the calcula- 
tions themselves, they must be made in te 
second period of conscious work, that which 
follows the inspiration, that in which one 
verifies the results of this inspiration a 
deduces their consequences. The rules q 
these calculations are strict and Cora ee 
They require discipline, attention, will, ae 
therefore consciousness. In the subliminal 


) a 
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self, on the contrary, reigns what I should 
call liberty, if we might give this name to 
the simple absence of discipline and to the 
ro fo Poen of chance. Only, this dis- 
order itself permits unexpected combinations 
(pp. 392-394). 


In addition to these pioneers there 
of course have been numerous others 
who in some manner have made a sub- 
stitute trial and error in a modeled or 
mnemonic environment an important 
aspect of their description of thinking. 
In rough chronology these include 
Baldwin (1906), Pillsbury (1910), 
Rignano (1923), Woodworth (1921), 
Woodworth & Schlosberg (1954), 
Thurstone (1924), Tolman (1926), 
Hull (1930), Muenzinger (1938), Mil- 
ler and Dollard (1941), Craik (1943), 
Boring (1950), Humphrey (1951), 
Mowrer (1954), Sluckin (1954), and 
many others. 


OBJECTIONS TO THE MODEL 


The Gestalt Protest 


The trial-and-error theme in learn- 
ing was of course one part of the syn- 
drome of ideas against which Gestalt 
psychology eloquently protested. In 
spite of this, there is judged to be no 
inherent conflict between the perspec- 
tives of this paper and the Gestalt posi- 
tion. To make this interpretation, it is 
necessary to regard neither traditional 
associationism nor Gestalt psychology 
as discrete integrated wholes, but in- 
stead to regard each as congeries of 
which the parts may be separately ac- 
cepted or rejected. 

The Gestaltists are judged to have 
validly rejected Thorndike’s (1898) de- 
scription of animal problem solving as 
solely a matter of overt locomotor trial 
daderror, As this writer (Campbell, 
1956b) has argued previously, recog- 
nizing vision as @ substitute trial-and- 
error process leads to the expectation 
that some locomotor problems will be 
solved by this means, obviating overt 


trial and error. Even more so does 
the model for thought. To the present 
writer the Gestaltists were correct de- 
scriptively even though epistemologi- 
cally the trial-and-error process re- 
mains fundamental to discovery. In 
Wertheimer’s (1959) specific con- 
trasts between insightful problem solv- 
ing and blind trial and error, it is a 
trial and error of overt manipulation 
which is involved. Furthermore, as 
Humphrey (1951) and Woodworth 
and Schlosberg (1954) point out, the 
Gestalt descriptions of problem solving 
provide ample evidence of both for- 
tuitous solutions and misleading 
“insights.” The recurrent Gestaltist 
protest that even the errorful trials are 
“intelligent” and that the subsequent 
trials make use of what was learned 
through the error are taken here as 
equivalent to the statements that the 
problem solver had some valid general 
knowledge to begin with, and that be- 
fore acting he employed the substitute 
trial and error of thought or vision. 
The blind-variation-and-selective-re- 
tention model of thought joins the 
Gestaltists in protest against the pic- 
ture of the learning organisms as a 
passive induction machine accumulat- 
ing contingencies. Instead, an active 
generation and checking of thought- 
trials, hypotheses, or molar responses 
is envisaged. The model at the level 
of thought places essential importance 
upon internalized selective criteria 
against which the thought trials are 
checked. Poincaré’s (1913) esthetic 
criteria and the Gestalt qualities of 
wholeness, symmetry, organized struc- 
ture, and the like can be regarded as 
built-in selective criteria completely 
compatible with the model. Pringle 
(1951) for example, has proposed a 
selective-retention model of central 
nervous system action in which sys- 
tematic temporal patterns provide the 
selective criteria in a resonance process. 
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J s the model here presented 
Son ee nature of the ae mil 
employed. There must often ea = 
and error among possible genera 
principles, or among rational abstrac- 
tions, or field reorganizations, or re- 
centerings. Both the blind variation 
and selective survival model, and 
Gestalt theory emphasize the advantage 
of breaking out of old ruts, and the 
disadvantages of set and rote drill 
(Boring, 1950; Dunker, 1945; Katona, 
1940; Luchins, 1942; Wertheimer, 
1959). Furthermore, the encounter- 
ing in thought of an idea which fits can 
be accompanied at the phenomenal level 
by a joyful “ahaerlebnis” or a Gestalt 
experience of “closure,” and at the 
overt performance level by a sudden 
and stable improvement signifying “in- 
sight.” There is no essential disagree- 
ment here, Nor is the trial-and-error 
model without phenomenological sup- 
port. Note the highly similar testimony 
from the disparate historical citations 
Provided above, especially in the im- 
agery of multitudinous, loosened, agi- 
tated, teeming, colliding, and interlock- 
ing ideas, 
This is not to say that a Gestalt 
psychologist would be happy with the 
blind-variation-and- selective- retention 
description of thought Processes. Nor 
are all aspects of the Gestalt syndrome 
here accepted. While “insight” is ac- 
cepted as a phenomenal counterpart of 
the successful completion of a perhaps 
unconscious blind-variation cycle, its 
status as an explanatory concept is re- 
jected, especially as it connotes “direct” 
ways of knowing. Furthermore, when 
publicized as a part of an ideology of 
creativity, it can reduce creativity 
through giving students a feeling that 
they lack an important gift Possessed 
by some others, a feeling which inhibits 
creative effort and increases depend- 
ence upon authority. Polya (1945, 
1954) has described such an inhibiting 


CAMPBELL 


tradition in the teaching of mathe- 
matics. 


Individual Differences and Genius 


Another prevalent orientation anti- 
thetical in spirit to the blind-variation- 
and-selective-retention model may be 
called the “mystique of the creative 
genius and the creative act.” This is 
related to our deeply rooted tendency 
toward causal perception (e.g., Heider, 
1944), a tendency to see marvelous 
achievements rooted in equally mar- 
velous antecedents. It takes the form 
of the “fallacy of accident” and of 
“post hoc ergo propter hoc” Let a 
dozen equally brilliant men each pro- 
Pose differing guesses about the un- 
known in an area of total ignorance, 
and let the guess of one man prove 
correct. From the blind-variation-and- 
selective-survival model this matching 
of guess and environment would pro- 
vide us with new knowledge about the 
environment but would tell us nothing 
about the greater genius of the one 
man—he just happened to be standing 
where lightning struck. In such a case, 
however, we would ordinarily be 
tempted to look for a subtle and special 
talent on the Part of this lucky man. 

Owever, for the genuinely unantici- 
patable creative act, our “awe” and 
“wonder” should be directed outward, 
at the external world thus revealed, 
rather than directed toward the ante- 
cedents of the discovery. Just as we 
do not impute special “foresight” to a 
Successful mutant allele over an un- 
Successful one, so in many cases of 
discovery, we should not expect mar- 
velous consequents to have had equally 
marvelous antecedents. Similarly, in 
comparing the problem-solving efforts 
of any one person; from the selective 
Survival model it will be futile, in the 
instance of a genuinely innovative 
achievement, to look for special ante- 
cedent conditions not obtaining for 
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blind-alley efforts: just insofar as there 
has been a genuine gain in knowledge, 
the difference between a hit and a 
miss lies in the selective conditions 
thus newly encountered, not in talent 
differences in the generation of the 
trials. 

This is not to deny individual differ- 
ences in creative intellect. Indeed, 
the blind-variation-and-selective-reten- 
tion model of creative thought predicts 
such talent differences along all of the 
parameters of the process. This is to 
emphasize, however, that explanations 
in terms of special antecedents will very 
often be irrelevant, and that the causal- 
interpretative biases of our minds make 
us prone to such over-interpretations, 
to post-hoc-ergo-proper-hoc interpreta- 
tions, deifying the creative genius to 
whom we impute a capacity for direct 
insight instead of mental flounderings 
and blind-alley entrances of the kind 
we are aware typify our own thought 
processes. Ernst Mach (1896) notes 
our nostalgia for the directly-knowing 
genius: “To our humiliation we learn 
that even the greatest men are born 
more for life than for science in the 
extent to which even they are indebted 
to accident” (p. 175). 

What are the ways in which thinkers 
might be expected to differ, according 
to the trial-and-error model? First, 
they may differ in the accuracy and 
detail of their representations of the 
external world, of possible locomotions 
in it or manipulations of its elements, 
and of the selective criteria. Differ- 
ences in this accuracy of representa- 
tion correspond to differences in de- 
gree of information and intelligence. 
Second, thinkers can differ in the 
number and range of variations in 
thought trials produced. The more 
numerous and the more varied such 
trials, the greater the chance of success. 
Bain has emphasized the role of fanati- 
cism or extreme dedication in pro- 


ducing large volumes of such explora- 
tions. Bain, Souriau, Mach, and 
Poincaré have all emphasized the role 
of advance preparation in assembling 
the elements whose blind permutation 
and combination make possible a wide 
range of trials. Many observers have 
emphasized the role of set and famili- 
arity in reducing the range of varia- 
tions, and have recommended ways of 
reducing trial-to-trial stereotypy, as by 
abandoning the problem for awhile, 
going on to other things. Devices 
abound which are designed to increase 
the likelihood that all permutations be 
considered and are used by most of 
us, as in going through the alphabet 
in finding rhymes or puzzle words. 
There are no doubt age differences in 
the rapidity and uninhibited range of 
thought-trial production. The sociology 
of knowledge makes an important con- 
tribution here: persons who have been 
uprooted from traditional cultures, or 
who have been thoroughly exposed to 
two or more cultures, seem to have the 
advantage in the range of hypotheses 
they are apt to consider, and through 
this means, in the frequency of creative 
innovation. Thorstein Veblen (1919) 
has espoused such a theory in his essay 
on the intellectual preeminence of Jews, 
as has Robert Park (1928) in writing 
of the role of “the marginal man” in 
cultural innovation. (See also Seeman, 
1956.) And more generally, it is the 
principle of variation which leads us 
to expect among innovators those of 
personal eccentricity and bizarre be- 
havior. We can also see in this prin- 
ciple the value of those laboratories 
whose social atmospheres allow wide 
ranging exploration with great toler- 
ance for blind alley entrances.  — 
The value of wide ranging variation 
in thought trials is of course vitiated if 
there is not the precise application of 
a selective criterion which weeds out 


the overwhelming bulk of inadequate 
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trials. This editing talent undoubtedly 
differs widely from person to person, 
as Poincaré (1908, 1913) has empha- 
‘sized. With regard to selection cri- 
teria, one further point should be 
made. Much of creative thought is 
opportunistic in the sense of having a 
wide number of selective criteria avail- 
able at all times, against which the 
thought trials are judged. The more 
creative thinker may be able to keep in 
mind more such criteria, and therefore 
increase his likelihood of achieving a 
serendipitous (Cannon, 1945; Merton, 
1949) advance ona problem tangential 
to his initial main line of endeavor 
(eg., Barber & Fox, 1958). Further 
areas of individual differences lie in 
the competence of the retention, cumu- 
lation, and transmisison of the encoun- 
tered solutions, 
It need not be expected that these 
dimensions of talent all go together, 
In organic evolution, the variation 
process of mutation and the preserva- 
tion of gains through genetic rigidity 
are at odds, with an increase in either 
being at the expense of the other, and 
with some degree of compromise being 
optimum. Just so we might expect 
that a very pure measure of innovative 
range in thought and a very pure 
measure of rote memory might be even 
negatively correlated, as Saugstad 
(1952) seems to have found, and 
similarly for innovative Tange and 
selective precision. Such considera- 
tions suggest complementary combina- 
tions of talent in creative teams, al- 
though the uninhibited idea-man and 
the compulsive edit-and-record type 
are notoriously incompatible office 
mates, a atts 
Notice regarding the individual 
differences thus described that while 
they do make creative innovation much 
more likely on the part of some in- 
dividuals than others, they do not place 
the joys of creative innovation beyond 
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the reach of the less gifted. Indeed, 
looking at large populations of think- 
ers, the principles make it likely that 


many important contributions will 
come from the relatively untalented, 
undiligent, and uneducated, even 


though on an average contribution per 
capita basis, they will contribute much 
less, points which Souriau (1881) has 
noted. The intricacy of the tradition 
to which innovation is being added of 
course places limitations in this regard. 


The Enormous Domain of Possible 
Thought-Trials to be Searched 


A final type of objection to the 
blind-variation-and- selective- retention 
model of thought needs to be con- 
sidered. This objection is to the ef- 
fect that the domain of possible 
thought trials is so large that the solu- 
tion of a given problem would take an 
impossibly long time were a search of 
all possibilities to be involved, either 
through a systematic scanning of all 
Possibilities Where these are enumer- 
able, or through a random sampling of 
the universe of possibilities. Time and 
trial estimates thus based can be over- 
whelming, as Kurt Lasswitz’s story 
“The Universal Library” (1958) dra- 
matically illustrates. Other parodies 
of our model occur in literature as 
far back as Swift’s portrait of the 
Academy of Lagado in Gullivers 
Travels (1941, Pp. 166-169). (Ley, 
1958, traces such ideas back to Lully, 
ca. 1200.) Newell, Shaw, and Simon 
(1958a, 1958b) refer in this vein to 
what they call the “British Museum 
Algorithm,” i.e., the possibility of a 
group of trained chimpanzees typing 
at random producing by chance in 
the course of a million years all of 
the books in the British Museum. 
Such parodies seem effectively to re- 
ject the blind-variation-and-selective- 
retention model through a reductio ad 
absurdum. Needless to say, such a 
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rejection is not accepted in the present 
paper. As a matter of fact, it is judged 
to be in the same class as parallel 
objections to the theory of natural 
selection in evolution. Similar features 
in these two instances make the acci- 
dentalist interpretation more accept- 
able. 

1. Neither in organic evolution nor 
in thought are all problems solved, 
nor all possible excellent solutions 
achieved. There is no guarantee of 
omniscience. The knowledge we do 
encounter is achieved against terrific 
odds. (Those advocating heuristi- 
cally-programed problem-solving com- 
puters are careful not to guarantee 
solutions, and this modesty should be 
extended to all models of creative 
thought.) 

2. The tremendous number of non- 
productive thought trials on the part 
of the total intellectual community 
must not be underestimated. Think of 
what a small proportion of thought 
becomes conscious, and of conscious 
thought what a small proportion gets 
uttered, what a still smaller fragment 
gets published, and what a small pro- 
portion of what is published is used by 
the next intellectual generation. There 
is a tremendous wastefulness, slowness, 
and rarity of achievement. 

3. In biological evolution and in 
thought, the number of variations ex- 
plored is greatly reduced by having 
selective criteria imposed at every step. 
Thus mutant variations on nonadaptive 
variations of the previous generation 
are never tested—even though many 
wonderful combinations may be missed 
therefore. Some of the “heuristics” 
currently employed in logic and chess 
playing machines (Newell, Shaw, & 
Simon, 1958a, 1958b) have the similar 
effect of evaluating all next-possible 
moves in terms of immediate criteria, 
and then of exploring further varia- 
tions upon only those stems passing 


the screening of each prior stage. It 
is this strategy of cumulating selected 
outcomes from a blind variation, and 
then exploring further blind variations 
only for this highly select stem, that, 
as R. A. Fisher has pointed out (e.g., 
1954, p. 91) makes the improbable in- 
evitable in organic evolution. This 
strategy is unavoidable for organic 
evolution, but can obviously be relaxed 
in thought processes and in machine 
problem solving. However, the Pan- 
dora’s box of permutations opened up 
by such relaxation can be used to infer 
that, in general, thought trials are 
selected or rejected within one or two 
removes of the established base from 
which they start. In constructing our 
“universal library” we stop work on 
any volume as soon as it is clear that 
it is gibberish. 

4. When we make estimates of the 
number of permutations which would 
have to be culled to obtain a given 
outcome, we often assume that problem 
solving was undertaken with that one 
fixed goal in mind. This overlooks 
the opportunistic, serendipitous course 
of organic evolution and of much of 
creative thinking. The likelihood of 
a productive thought increases with 
the wider variety of reasons one 
has for judging a given outcome 
“interesting.” To neglect this oppor- 
tunistic multipurposedness gives one 
a poor base for estimating the proba- 
bility of encountering the one outcome 
hit upon and recorded. Thus, when 
Newell, Shaw, and Simon's Logic 
Theorist” (1958a, 1958b) sets out to 


rove the 60-odd theorems in a given 
asie of Principia Mathematica, it 
may face a more formidable task than 
did Whitehead and Russell in generat- 


ing them, if, except for the dozen 
classic theorems reproduced, White- 
head and Russell were otherwise free 
to record every deduction they en- 
countered which seemed “interesting 
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or “nontrivial.” Wigglesworth (1955) 

has noted this strategy on the part of 

“pure” scientists, in commenting on the 
relationship between pure and applied 

scientists in wartime : 


In the pure science to which they were ac- 
customed, if they were unable to solve prob- 
lem A they could turn to problem B, and 
while studying this with perhaps small pros- 
pect of success they might suddenly come 
across a clue to the solution of problem C 
(p. 34). 


In presenting their case for adding 
“heuristics” to the program of the 
“Logic Theorist,” Newell, Shaw, and 
Simon have emphasized the inadequacy 
of blind trial and error. So has 
Miller (1959) in advocating the 
heuristic of searching backward from 
the goal. There is, however, no es- 


*Miller (1959, pp. 244-246) is wrong in 
implying that the strategy of working back- 
ward from the goal eliminates the necessity 
of symbolic trial and error in creative 
thinking. His mistake comes from assuming 
that only one path leads into a goal or sub- 
goal. In the spatial locomotion problems 
from which his concrete illustrations come, 
and for the logic Problems used by Newell, 
Shaw, and Simon, the paths into any position 
are not singular, but are instead typically 
as numerous as paths leading out, “A pure 
strategy of working from the goal back to 
the start would thus involye exploring just 
as many permutations as the pure strategy 
of exploring all paths from the start posi- 
tion. However, there is a useful strategy 
available to symbolic trial and error and not 
to overt trial and error, in Working con- 
currently from both ends. This Produces a 
dramatic advantage in the number of permu- 
tations generated, and a smaller but still 
substantial gain in the number of compari- 
sons, In the instance suggested by Miller in 
which each locus branches into 4 alterna- 
tives and in which the start and goal turn 
out to lie 7 stages apart, either of the pure 
strategies would generate 4+ 42+ 434 
44+ 45+ 48+ 47 or 21,844 permutations 
(neglecting the probable achievement „of 
success before exhausting the 47 generation 

of alternatives). For the mixed approach, 
the junction would be encountered at the 
third stage away from each end, or when 
2(4+42 +43) or 168 alternatives had been 
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sential disagreement between their 
point of view and the one offered 
here. By adding heuristics mechanical 
thought processes have indeed been 
made more like those of human beings, 
both in adequacy and type of errors. 
Such innovations have obviated the 
protests of those such as Wisdom 
(1952) and Mays (1956) who, while 
conceding that machines could choose 
good moves at chess or solve logic 
problems, have found the machines 
failing to imitate life just in their 
orderly inspection of all possibilities. 
Newell, Shaw, and Simon recognize 
that a machine which would develop its 
own heuristics would have to do so by 
a trial and error of heuristic principles, 
with no guarantee that any would 
work. They further recognize that 
Possession of an effective heuristic 
represents already achieved general 
knowledge about’ the domain under 
search, and that adding to this gen- 
eral knowledge will be a blind search 


process. (The devices of learning 
and vision and of coding environ- 


mental possibilities for thought-search 


examined. In 

would be 21,845 
is, the start Pos 
alternative 

goal. In th 
comparisons 
Fach 


the pure strategy, there 
comparisons involved, that 
ition and each subsequent 
would be compared with the 
he mixed strategy, many more 
per alternative are required. 
Permutation must be compared with 
each of the current and previous permuta- 
tions on the other stem, which in this in- 
stance amounts to 7,225 comparisons. If 
the comparisons are regarded as equally 
costly as the generating and storing of alter- 
natives, the savings of the mixed approach 
Over either of the pure approaches would 
amount to approximately 1 to 6, a vey 
handsome gain for any heuristic. This eeu 
is larger as the number of branches at am 
stage increases. Advocacy of the heuristie 
of working backwards in what is essentially 
this mixed form is present in Polya gan 
Wisdom (1952), and Newell, Shaw, TR 
Simon (1958a, 1958p). In none of these is 41 
claimed that trial and error is eliminated. 
while all point to the reduction in trials 
which it can achieve. 
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all represent heuristics in this sense.) 
They might also agree that most 
heuristic devices will be limited to the 
specific domain of their discovery, and 
can only be extended to other domains 
on a trial basis. They would probably 
also agree that no problem solving 
process will be “direct.” The dis- 
agreements I have with their ex- 
cellent paper on the processes of cre- 
ative thinking (Newell, Shaw, & 
Simon, 1958b) are thus minor matters 
of emphasis, but may be worth stating 
nonetheless, to further clarify the 
position here advocated. They say, for 
example : 


We have given enough estimates of the sizes 
of the spaces involved ... to cast suspicion 
upon a theory of creativity which places its 
emphasis upon increase in trial and error 


(p. 63). 


The blind-variation-and-selective-reten- 
tion model unequivocally implies that 
ceteris paribus, the greater the heter- 
ogeneity and volume of trials the 
greater the chance of a productive in- 
novation. Doubling the number of 
efforts very nearly doubles the chance 
of a hit, particularly when trials are a 
small part of the total domain and the 
repetitiousness among trials is low. 
But they too recognize unconvention- 
ality and no doubt numerosity as a 
necessary, if not a sufficient condition 
of creativity (1958b, p. 62). What 
they would validly stress, is the very 
frequent tactical advantage of a trial 
and error of general strategic prin- 
ciples over a trial and error involving 
no classificatory effort nor attempt to 
use clues, and, once such general 
heuristics have been discovered, the 
advantages of a hierarchized trial and 
error process. The advantage of such 
a strategy depends upon the ecology, 
of course, but we are in general justi- 
fied in expecting solutions to be non- 
randomly distributed, and to show 


significant contingencies with prior 
clues. Polya (1945, 1954) has, of 
course, been a major source of in- 
spiration for all efforts to introduce 
heuristics into problem solving, and 
for him a trial and error approach is 
a heuristic of fundamental importance. 

Another minor point of disagree- 
ment may be mentioned. In their 
efforts to consider how a “Logic 
Theorist” might be programed to 
learn a general heuristic from hind- 
sight they propose that it keep a 
record of the outcomes of all past 
trials, successful and unsuccessful, in 
order to be able to scan its experience 
for general principles of strategy 
(1958b). Implementing this would 
put a tremendous strain upon memory 
storage, and would introduce a scan- 
ning process as time consuming as the 
original search process which pro- 
duced the record. The strategy of 
organic evolution is to keep a record 
only of what works, even at the ex- 
pense of repeating its errors. ia he gen- 
eral preponderance of wrong tries at 
every level, plus problems of memory 
glut and access, suggests a similar 
strategy for all knowledge processes. 
Heuristics can probably best be learned 
through a trial and error of heuristics, 
tried on new problem sets rather than 
old. 

Sratus as A THEORY 


At the level here developed, one might 
better speak of an “orientation to,” or 
a “perspective on” creative thought 
processes, rather than a theory i 
At many points, this perspective ae y 
points to problems, rather than taking 
that step toward theory of providing 
guesses at answers: €g- for the proc- 
esses here outlined to be possible, 
theory must not only provide several 


memory processes, but most im- 
ortantly, must specify a possible 
p for the trial-and-error 


mechanism 
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search of these. From such specifica- 
tions will come the subtlety of predic- 
tion characteristic of a developed 
theory.” While the perspective even 
in its very general state has some un- 
equivocal empirical implications, the 
major advantage to it may be meta- 
physical, or at least metatheoretical. 
Like the theory of natural selection in 
organic evolution, it provides an under- 
standing of marvelously purposive 
processes without the introduction of 
teleological metaphysics or of pseudo- 
causal processes working backward in 
time. 

Note that there are still ambiguities 
about the status as theory of the well 
established principle of Organic evolu- 
tion through natural selection, even 
though now buttressed with the de- 
tailed genetic model of the variation 
and retention Processes, Scriven 
(1959) has called it an explanatory 
rather than a predictive theory. While 
the present writer feels that Scriven 
has somewhat undervalued the experi- 
mental studies of evolution with 
viruses, bacteria, and insects, and may 


This paper does not attempt to review 
theories in this area. Citations to the im- 
portant contributions of Pringle (1951) and 
Hebb (1949), do not begin to represent this 
literature. Note the special problem of a 
brain analogue to switching. Ashby’'s 
(1952) model and most Computer memory 
search involves a spatial displacement of 
solids impossible in the brain, as in the 
stepping switch or the rotation of a mag- 
netic memory drum. Computer memory 
search processes making use of timed pulses 
require a precision of timing dependent upon 
a stability of dimension Presumably not 
available in the brain and usually if not 
always dependent upon a clock within which 
actual spatial displacement of solids takes 
place. It is for these reasons that Pringle’s 
(1951) theory seems particularly promising, 
and one wonders why it has not been more 
used, or if not usable, more publicly refuted. 
See Pribram (1959) for a recent contribu- 
tion to the problem of appropriate brain 
process models. 
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have mistaken the past absence of ex- 
perimental settings appropriate to test- 
ing the theory for an inherent attribute 
of the type of theory per se; he has 
called attention, nonetheless, to some 
serious problems. Even Sewall 
Wright (1960), whose statistical ge- 
netic theory of evolution has added 
subtle details to the overall description 


of the Process, has commented in a 
similar vein: 


The theory is deterministic only in an ex- 
ceedingly limited sense. It is essentially a 
theory of the conditions favorable for an 
ever continuing process that is essentially un- 
Predictable in its details. There can be no 
formula for serendipity (p. 148). 


The problems which a selective re- 
tention theory of creative thinking 


shares with that of evolution include 
the following : 


1. The basic insight, so useful and so 
thrilling when first encountered, is close to 
being an analytic tautology rather than a 


synthetic description of process: if indeed 
variations 


For most applications of the selective 
t , the variation is taken as a 
descriptive given, as an unexplained part of 
the explanation. While other predictive 
theories likewise depend upon unexplained 
processes at a more molecular level, this 


crucial point, and has j 
a denial of determ 
physic of “spontaneous change” as presump- 
tive as a teleological one. We are currently 
getting detailed deterministic explanations 
of the mutation of genes, but until something 
comparable is available to predict the gen- 
eration of heterogeneous thought trials, this 
constitutes a weakness of the model. A 

3. The biological study of the evolution of 
any species takes the form of a post hoc 
reconstruction of a unique, “undetermined, 
historical Process. The achievement of any 
general regularities must be probabilistic 
in the extreme. Studies starting from spe- 
cific spectacular achievements in creative 
thought must be similar in nature, — 

4. The theory suffers from the multiplicity 
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of possible mediations it posits. Where 
there are gaps in the historical knowledge, 
the theory makes available an embarrassing 
surfeit of possible reconstructions, In this, 
and in contrast to the successful theories 
of macrophysics, the theory is less self- 
disciplining, less specific in its predictions, 
more evasive of potential disconfirmation. 
This is perhaps the most important of 
Scriven’s (1959) points, and one equally ap- 
plicable to the theory of creative thinking. 

5. In the usual applications, the environ- 
ment is not described or describable prior to 
the organismic achievement of adaptation to 
that environment. Whereas the theory deals 
with an iterative process whereby an organ- 
ism adapts to (achieves knowledge of) an 
independent environment, the evidences as 
to the organismic form and environmental 
parameters are often confounded, in that the 
same data series is used to infer both. While 
this is not so for laboratory studies of trial- 
and-error learning, it is particularly apt to 
be so for any study of truly great creative 
thinking in science. (See Campbell, 1959, 
p. 157 for epistemological citations to this 
problem.) 


There is in addition, a serious 
problem which the blind-variation-and- 
selective-retention theory of creative 
thought faces which is not present 
in comparable degree in the modern 
theory of organic evolution. This is 
the unfavorable ratio of hypothesized 
unobservable processes to observable 
input-output variables. Note that even 
in its sketchy form here given, some 
6 to 14 or more separately variable 
parameters are implied. These in- 
clude: (a) A mnemonic representation 
of environment, varying perhaps in 
scope, accuracy, and fineness of detail; 
(b) A mnemonic search or thought- 
trial process, varying in the accuracy 
with which it represents potential overt 
exploration; (c) A thought-trial gen- 
erating and changing process, varying 
in rate, heterogeneity, idiosyncrasy, and 
lack of repetitiousness among succes- 
sive shiught trials; (d) Selective crite- 
ria, varying in their number, accuracy 
of representation of environmental con- 


tingencies, and precision, sharpness, or 


selection ratio; (e) A preservation or 
propagation process, providing a reten- 
tion for selected thought trials of a 
quite different order from the memory 
traces of the nonselected ones, varying 
perhaps in accuracy and accessibility ; 
(f) A reality testing process in which 
the selected thought trials are checked 
out by overt locomotion in the external 
environment, varying perhaps in sensi- 
tivity to disconfirming feed-back. 

This inventory of weaknesses does not, 
of course, represent argument in favor 
of rival theories of creative thought, 
which are judged to be still more 
amorphous, still less adequate. And as 
Duncan (1959) makes clear in review- 
ing the research literature on human 
problem solving, for all theories there 
is lacking a disciplined relation both 
to experimental undertakings and to 
findings. Even in its present form, 
however, the theory contains many 
empirical implications. Manipulation 
of any one of the 14 variables just 
listed should increase the number of 
creative products, providing the other 
variables can be held constant. Pre- 
dictions of this order have been speci- 
fied in the discussion of individual dif- 
ferences. Particularly characteristic 
are the unequivocal predictions re- 
garding the volume and heterogeneity 
of thought trials. Ceteris paribus, a 
creative solution is more likely the 
longer a problem is worked upon, the 
more variable the thought trials, the 
more people working on the problem 
independently, the more heterogeneous 
these people, the less the time pres- 
sure, etc. 

SUMMARY 


This paper has attempted to make 
the psychological and epistemological 
point that all processes leading to ex- 
pansions of knowledge involve a blind- 
variation-and-selective-retention proc- 


ess. Processes, such as vision and 
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thought, substituting for an overt trial 
and error are of couse acknowledged. 
But each of these are interpreted as 
containing in its very workability wis- 
dom about the environment obtained 
originally by the blind variation of 
mutation and natural selection. In 
addition, each contains a blind-varia- 
tion-and-selective-retention process at 
its own level. Supporting the effort 
to interpret all knowledge processes 
in this light has been an emphasis 
upon the tremendous gain in knowl- 
edge in the course of evolution and 
history, a gain which can only be ex- 
plained by a continual breakout from 
the bounds of what was already 
known, a breakout for which blind var- 
iation provides the only mechanism 
available. 

The application of this perspective 
to the process of creative thought 
antedates its application to trial-and- 
error learning in animals and to or- 
ganic evolution, and is illustrated 
through quotations from Bain, Souriau, 
Mach, and Poincaré, The model is 
not in disagreement with the bulk of 
the Gestalt description of problem 
solving, nor the work on heuristically 
programed problem-solving computers. 
However, there is an effort to root 
out a prevailing implicit belief in the 
possibility of “direct” or “insightful” 
creative thought processes. 
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